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(57) Abstract 

The present invention relates to isolated and/or recombinant nucleic acids which encode a mammalian (e.g., human, mouse) uncoupling 
protein 3 (UCP3) and an alternative form of UCP3 designated UCP3-short form (UCP3sh). In addition, the present invention relates to 
nucleic acids which hybridize with the UCP3 nucleic acids described herein and functional portions thereof. Also encompassed by the 
invention are a nucleic acid construct comprising a nucleic acid which encodes a UCP3 protein and a host cell; a host cell comprising 
the nucleic acid construct which encodes UCP3; and a method for producing mammalian UCP3 comprising introducing into a host cell 
the nucleic acid construct which encodes UCP3 whereby the nucleic acid is expressed. The present invention also relates to isolated or 
recombinantly produced UCP3 protein and functional portions thereof. Also encompassed by the invention are a method of identifying 
an inhibitor (e.g., antibody) or enhancer of UCP3 expression and/or function, and the use of UCP3 inhibitors and. enhancers. The present 
invention also relates to a method of detecting UCP3 in a sample obtained from a individual. 
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UCP3 : AN UNCOUPLING PROTEIN HOMOLOGUE 

RELATED APPLICATIONS 

This application is a Continuation-in-Part of 
Application No. 08/892,745 entitled "UCP3 : An Uncoupling 
5 Protein Homologue Expressed Selectively and Abundantly in 
Skeletal Muscle and Brown Adipose Tissue" filed July 15, 
1997 and claims benefit of U.S. provisional application 
number 60/046 , 254 entitled "Discovery of an Alternative 
Form of UCP3 , Designated UCP3-Short Form (UCPsh) » filed May 

10 12, 1997 and U.S. provisional application number 

60/043,447, entitled "An Uncoupling Protein Homologue 
Expressed Selectively and Abundantly in Skeletal Muscle and 
Brown Adipose Tissue" , filed April 9, 1997. The teachings 
of Application No. 08/892,745, U.S. provisional application 

15 number 60/043,447 and U.S. provisional application number 
60/046,254 are incorporated herein by reference in their 
entirety. 

GOVERNMENT FUNDING 

This work was supported in part by the National 
20 Institutes of Health Grants DK02119 and DK49569. 

Therefore, the U.S. Government has certain rights in the 
invention. 

BACKGROUND 

Calories are expended by mitochondria in a highly 
2 5 regulated fashion. Oxidation of fuels by the electron 

, transport chain generates a proton electrochemical gradient 
across the inner mitochondrial membrane. Re-entry of 
protons via ATP synthesis drives conversion of ADP to ATP. 
Uncoupling proteins (UCPs) are inner mitochondrial membrane 
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transporters which dissipate the proton gradient, releasing 
stored energy as heat (Nicholls, D.G., et al . , Physiol. 
Rev., 64:1-64 (1984); Klingenberg, M . , et al . , TrnedS 
Biochem. Sci., 15:108-112 (1990)). For this reason, UCPs 
5 are potentially important determinants of metabolic 
efficiency. UCP1, the first uncoupling protein to be 
identified (Lin, C.S., et al . , FEBS Lett. , 113:299-303 
(1980); Jacobsson, A., et al . , J . Biol. CJiem. , 260:16250- 
16254 (1985); Bouillaud, F., et al . , J. Biol. Chem. , 
10 251:1487-1490 (1986)), is expressed exclusively in brown 

adipose tissue, an important site of energy expenditure in 
rodents (Himms-Hagen, J., Prog. Lipid Res., 28:67-115 
(1989)) . However, UCP1 may be of lesser importance in 
humans, in whom the amount of brown adipose tissue is 
15 limited. A second uncoupling protein, referred to UCP2 , 
was recently identified (Fleury, C, et al., Nature 
Genetics, 15:269-272 (1997)) or UCPH (Gimeno, R.E., et al . , 
Diabetes, 46:900-906 (1997)). In contrast with UCP1, UCP2 
is expressed in many tissues, including sites not thought 
2 0 to mediate energy expenditure which occurs in response to 

environmental temperature ' or diet (adaptive thermogenesis) . 

A greater understanding of the genes involved in 
metabolism will provide new approaches and targets for 
regulating energy expenditure in mammals. 

25 SUMMARY OF THE INVENTION 

The present invention relates to an uncoupling protein 
(UCP3) gene which is selectively expressed in skeletal 
muscle and brown fat, two tissues involved in energy 
expenditure in mammals. In addition, the invention relates 

30 to an alternative form of UCP3 designated UCP3-sftort form 
(UCP3sh) , which is also expressed in skeletal muscle. 
Skeletal muscle particularly has a capacity for energy 
expenditure, or adaptive thermogenesis, in humans . 
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As used herein, U UCP3" refers to UCP3 and UCP3sh. In 
particular, the present invention relates to isolated 
(e.g., purified, essentially pure) nucleic acids 
(oligonucleotides, nucleotide sequences) which encode a 
5 mammalian (e.g., human) UCP3 protein, and include for 
example, nucleic acids (DNA, RNA) obtained from natural 
sources, recombinantly produced or chemically synthesized. 
The nucleic acids of the present invention include nucleic 
acid? encoding human UCP3 (SEQ ID NO: 1), human UCP3sh (SEQ 

10 ID NO: 2), mouse UCP3 (SEQ ID NO: 7) and characteristic 
portions thereof (e.g., probes, primers). The invention 
also includes complementary sequences (i.e., a complement) 
of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 7 and \ 
characteristic portions thereof. The nucleic acids of the 

15 present invention encompass nucleic acids encoding a human 
UCP3 amino acid sequence (SEQ ID NO: 3) , a human UCP3sh 
form amino acid sequence (SEQ ID NO: 4) , a mouse UCP3 amino 
acid sequence (SEQ ID NO: 8) and characteristic portions 
thereof . 

20 ' The present invention further relates to isolated, 

recombinantly produced or synthetic nucleic acids which 
hybridize to the nucleic acids described herein (e.g., SEQ 
ID NO: 1, SEQ ID NO : 2, SEQ ID NO : 7 or characteristic 
portions thereof) and encode UCP3 protein (a protein having 

25 the same amino acid sequence as the amino acid sequences 
included herein and/or a protein which exhibits the same 
characteristics as the UCP3 protein described herein) . In 
particular, the invention relates to nucleic acids which 
hybridize, under moderate or high stringency, conditions, to 

3 0 SEQ ID NO: 1, SEQ ID NO: 2 f SEQ ID NO: 7, characteristic 
portions thereof or other sequences which encode. UCP3 . 

Also encompassed by the present invention is a nucleic 
acid construct comprising nucleic acid which encodes a UCP3 
protein (e.g., SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO : 7 and 
3 5 characteristic portions thereof) , wherein the nucleic acid 
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of the construct is expressed when the construct is present 
in an appropriate host cell. In one embodiment, the 
nucleic acid construct of the present invention is operably 
linked to exogenous regulatory sequence (s) such as a 
5 promoter and/or enhancer, whereby mammalian UCP3 is 

expressed when the host cell is maintained under conditions 
suitable for expression. The present invention also 
relates to a host cell comprising nucleic acid encoding 
mammalian UCP3 protein. 
!0 Also encompassed by the present invention is a method 

for producing a mammalian UCP3 protein (human) ♦ In the 
method, a nucleic acid construct comprising a nucleotide 
sequence (DNA, RNA) which encodes a mammalian UCP3 protein 
is introduced into a host cell, resulting in production of 
15 a recombinant host cell which contains a UCP3 coding 
sequence operably linked to an (i.e., at least one) 
expression control sequence. The host cells produced are 
maintained in a suitable medium under conditions 
appropriate for the nucleotide sequence to be expressed, 
2 0 whereby the encoded UCP3 is produced. 

The present invention also relates to isolated (e.g., 
purified, essentially pure) UCP3 protein and includes, for 
example, UCP3 protein obtained from natural sources, 
recombinantly produced or chemically synthesized. For 
25 example, the UCP3 protein can be human UCP3 protein (SEQ ID 
NO: 3) , human UCP3sh (SEQ ID NO : 4 ) , mouse UCP3 protein (SEQ 
ID NO: 8) or functional portions thereof. 

The present invention also pertains to a method of 
identifying agents which modulate or alter (e.g., inhibit 
30 or enhance) UCP3 activity. An inhibitor of UCP3 interferes 
(partially or completely) with the function or bioactivity 
of UCP3 , directly or indirectly. An enhancer (activator) 
of UCP3 increases or enhances the function or bioactivity 
of UCP3 , directly or indirectly. 
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In one embodiment, the present invention relates to a 
method of identifying an agent which alters UCP3 activity, 
wherein a nucleic acid construct comprising nucleic acid 
which encodes a mammalian UCP3 is introduced into a host 
5 cell (s) . The host cells produced are maintained under 
conditions appropriate for expression of the encoded 
mammalian UCP3,, whereby the nucleic acid is expreased. The 
host cells are then contacted with a compound to be 
assessed (an agent) and the mitochondrial electrical 

10 potential of the cells is detected in the presence of the 
compound to be assessed. Detection of a change in 
mitochondrial electrical potential in the presence of the 
agent indicates that the agent alters UCP3 activity. In a 
particular embodiment, the invention relates to a method of 

15 identifying an agent which is an activator of UCP3 activity 
wherein a nucleic acid construct comprising nucleic acid 
which encodes a mammalian UCP3 is introduced into a host 
cell (s) . The host cells produced are maintained under 
conditions appropriate for expression of the encoded 

2 0 mammalian UCP3 , whereby the nucleic acid is expressed. The 

host cells are then contacted with a compound to be 
assessed (an agent) and the mitochondrial electrical 
potential of the cells is detected in the presence of the 
compound- to be assessed. Detection of a decrease or 
25 reduction of mitochondrial electrical potential ■ in the 
presence of the agent indicates that the agent activates 
UCP3 activity. In another embodiment, the invention 
relates to a method of identifying an agent which is an 
inhibitor of UCP3 activity, wherein a nucleic acid 

3 0 construct comprising nucleic acid which encodes a mammalian 

UCP3 is introduced into a host cell (s) . The host cells 
produced are maintained under conditions appropriate for 
expression of the encoded mammalian UCP3 , whereby the 
nucleic acid is expressed. The host cells are then 
3 5 contacted with a compound to be assessed (an agent) and the 
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. mitochondrial electrical potential of the cells is detected 

in the presence of the compound to be assessed. Detection 
' of an increase of mitochondrial electrical potential ,in the 
presence of the agent indicates that the agent inhibits 
5 UCP3 activity. Methods of identifying agents which alter 
UCP3 activity can also be performed, as described herein, 
using a mixture of a membrane fraction, mitochondria and 
UCP3 (Jezek, etal., J. Biol. Chem. 271:6199-6205 (1996)). 
Also encompassed by the present invention is an agent 
10 which interacts with UCP3 directly or indirectly, and 

inhibits or enhances UCP3 function. In one embodiment, the 
agent is an inhibitor which interferes with UCP3 directly 
(e.g., by binding UCP3) or indirectly (e.g., by blocking 
the ability of UCP3 to regulate thermogenesis in skeletal 
15 muscle and/or brown adipose tissue) . In a particular 

embodiment, an inhibitor of the UCP3 protein is an antibody 
specific for UCP3 protein or a portion of a UCP3 protein; 
that is, the antibody binds the UCP3 protein. For example, 
the antibody can be specific for the human UCP3 protein 
2 0 (SEQ ID NO: 3, SEQ ID NO: 4) , the mouse UCP3 protein (SEQ 
ID NO: 8) or functional portions thereof. Alternatively, 
the inhibitor can be an agent other than an antibody (e.g., 
small organic molecule, protein, peptide) which binds UCP3 
and blocks its activity. Furthermore, the inhibitor can be 
25 an agent which mimics UCP3 structurally but lacks its 

function. Alternatively, the inhibitor of UCP3 can be an 
agent which binds to or interacts with a molecule which 
UCP3 normally binds with or interacts with, thus blocking 
UCP3 from doing so and preventing it from exerting the 
30 effects it would normally exert. In another embodiment, 
the agent is an enhancer of UCP3 which increases the 
activity of UCP3 (increases thermogenesis in skeletal 
muscle and/or brown adipose tissue) , increases the length 
of time it is effective (by preventing its degradation or 



WO 98/45438 



PCTYUS98/06959 



otherwise prolonging the time during which it. is active) or 
both, either directly or indirectly. 

The present invention also relates to antibodies 
(monoclonal or polyclonal) or functional portions thereof 
5 (e.g., an antigen binding portion such as an Fv, Fab, Fab', 
or F(ab') 2 fragment) which bind mammalian UCP3 . 

Isolation of UCP3 makes it possible to detect UCP3 in 
a sample (e.g., test sample) . The present invention also 
relates to a method of detecting mammalian UCP3 in a sample 
10 (e.g., skeletal muscle, brown adipose tissue) obtained 
from an individual, such as a human. In one embodiment, 
the sample is treated to render nucleic acids in the sample 
available for hybridization to a nucleic acid probe (e.g., 
SEQ ID NO; 1, SEQ ID NO : 2, SEQ ID NO : 7 and/or 
15 characteristic portions thereof which bind to 

characteristic regions of UCP3 -encoding nucleic acids) . 
The treated sample is combined with a nucleic acid probe 
(labeled or unlabeled) comprising or complementary to all 
or a characteristic portion of the nucleotide sequence 

2 0 encoding UCP3 protein, under conditions appropriate for 

hybridization of complementary nucleic acids to occur. 
Hybridization of nucleic acids in the treated sample with 
the nucleic acid probe is detected; the occurrence of 
hybridization indicates the presence of UCP3 protein in the 
25 sample. In another embodiment, the sample is contacted 

with an antibody which binds to UCP3 protein (e.g., SEQ ID 
NO: 3, SEQ ID NO : 4, SEQ ID NO : 8 or functional portions 
thereof) under conditions suitable for binding of the 
antibody to the mammalian UCP3 . Binding of the antibody to 

3 0 a component of the sample is detected; binding of the 

antibody to a component of the sample indicates the 
presence of UCP3 protein in the sample. 

Isolation of UCP3 also makes it possible to identify a 
promoter (s) and/or enhancer (s) of the UCP3 gene. 



WO 98745438 PCT/US98/06959 



Identification of promoters . and/or enhancers of the UCP3 
gene allow for identification of regulators of UCP3 
transcription. 

In addition, the present invention relates to 
5 transgenic non human animals (e.g., mice) which lack the 
UCP3 gene or contain a nonfunctional UCP3 gene such that 
UCP3 activity is lacking (e.g., UCP3 knockout mouse). The 
invention also relates to methods of producing UCP3 gene 
knockout animals, such as mice. UCP3 knockout mice can be 
10 used to further study the UCP3 gene and to assay for 
inhibitors and enhancers of UCP3 . 

The present invention also relates to a method of 
inhibiting (partially, completely) protein catabcilism in a 
mammal (e.g., human) comprising administering to the mammal 
15 an effective amount of an inhibitor of UCP3 . The invention 
also relates to a method of enhancing protein catabolism in 
a mammal comprising administering to the mammal an 
effective amount of an enhancer of UCP3 . Also encompassed 
by the present invention is a method of inhibiting muscle 
20 wasting in a mammal comprising administering an effective 
amount of an inhibitor of UCP3 to the mammal . 

.Discovery of the UCP3 gene provides for selective 
modulation (enhancement, inhibition) of the expression 
and/or function of the UCP3 gene in skeletal muscle and 
25 brown fat, two tissues involved in adaptive thermogenesis . 

BRIEF DESCRIPTION OF THE FIGURES 

Figures 1A-1C are the nucleotide sequence of human 
UCP3 (SEQ ID NO: 1) and three different amino acid 
sequences (SEQ ID NO: 27, SEQ ID NO: 28 and SEQ ID NO: 29) 
3 0 translated from SEQ ID NO: 1. 

Figures 2A-2B are the nucleotide sequence of the UCP3- 
short form (UCP3sh) gene {SEQ ID NO: 2) and three different 
amino acid sequences (SEQ ID NO: 30, SEQ ID. NO: 31 and SEQ 
ID NO: 32) translated from SEQ ID NO: 2. 
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Figure 3 is a comparison of the human UCP3 amino acid 
sequence (SEQ ID NO: 3) , the human UCP3sh amino acid 
sequence (SEQ ID NO: 4) , the human UCP1 amino acid sequence 
(SEQ ID NO: 5) and the human UCP2 amino acid sequence (SEQ 
5 ID NO: 6) ; sequence alignments were performed using the 
ALIGN program (Myers, E.W., and Miller, W. , Computer Appl . 
Biosci. 4:11-17 (1988); and the Genbank accession numbers 
for hUCPl, hUCP2 and hUCP3 are U28480, U94592 and AF001787, 
respectively. 

10 Figure 4 is a graph of the hydrophilicity plots of 

human UCP2 and human UCP3 showing the hydrophobicity of 
protein across linear sequence; hydrophilicity plots for 
hUCP2 and hUCP3 were generated using the methods \of Kyte 
and Doolittle (Kyte, J. and Doolittle, R.F., J". Mol . Biol. 

15 157:105-132 (1982)). 

Figures 5A-5C are the nucleotide sequence of mouse 
UCP3 (SEQ ID NO: 7) and three different amino acid 
sequences (SEQ ID NO: 33, SEQ ID NO: 34, SEQ ID NO: 35) 
translated from SEQ ID NO: 7. 

2 0 Figure 6 is the amino acid sequence of mouse UCP3 (SEQ 

ID NO: 8) . 

.Figure 7 is a comparison of the mouse UCP3 amino acid 
sequence (SEQ ID NO: 8) with the mouse UCP1 amino acid 
sequence (SEQ ID NO: 9) , the mouse UCP2 amino acid sequence 

2 5 (SEQ ID NO: 10) and the human UCP3 amino acid sequence (SEQ 

ID NO: 3); the attached sequence and amino acid alignments, 
mUCP3 is 46% identical to mUCPl , 62% identical to mUCP2 but 
is 82% identical to hUCP3 . 

Figure 8 is a graphic- representation of the genomic 

3 0 ' organization of the human UCP3 gene, and shows the splice 

donor sequence (SEQ ID NO: 11) and splice acceptor sequence 
i (SEQ ID NO: 12) between exons 1 and 2, the splice donor 
sequence (SEQ ID NO: 13) and splice acceptor sequence (SEQ 
ID NO: 14) between exons 2 and 3, the splice donor sequence 
35 (SEQ ID NO: 15) and splice acceptor sequence (SEQ ID NO: 
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16) between exons 3 and 4, the splice donor sequence (SEQ 
ID NO: 17) and splice acceptor sequence (SEQ ID NO: 18) 
between exons 4 and 5, the splice donor sequence (SEQ ID 
NO: 19) and splice acceptor sequence (SEQ ID NO: 20) 
5 between exons 5 and 6, and the splice donor sequence (SEQ 
ID NO: 21) and splice acceptor sequence (SEQ ID NO: 22) 
between exons 6 and 7 of the UCP3 gene. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to an uncoupling protein 
10 (UCP3) gene which is selectively expressed in skeletal 
muscle and brown fat, two tissues involved in energy- 
expenditure in mammals. In addition, the invention relates 
to an alternative form of UCP3 designated UCP3- short form 
(UCP3sh) , which is also expressed in skeletal muscle. As 
15 used herein, "UCP3 H refers to UCP3 and UCP3sh. 

The present invention relates to isolated (e.g., 
purified, essentially pure) UCP3 gene which is involved in 
regulation of thermogenesis (energy expenditure) in 
mammals. In particular, the present invention relates to 
20 nucleic acids (e.g., DNA, RNA, oligonucleotides, 

polynucleotides) or characteristic portions thereof as 
described herein, obtained, from natural sources, 
recombinantly produced or chemically synthesized which 
encode a mammalian UCP3 or functional portion thereof. 
25 Nucleic acids referred to herein as "isolated" are 

.nucleic acids substantially free of (separated away from) 
the nucleic acids of the genomic DNA or cellular RNA of 
their biological source of origin (e.g., as it exists in 
cells or in a mixture of nucleic acids such as a library) , 
3 0 and may have undergone further processing. "Isolated" 
nucleic acids include nucleic acids obtained by methods 
described herein, similar methods or other suitable 
methods, including essentially pure nucleic acids, nucleic 
acids produced by chemical synthesis or by combinations of 
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biological and chemical methods, and recombinantly produced 
nucleic acids which are isolated (see e.g., Daugherty, B.L. 
et al., Nucleic Acids Res., 19 (9) ;2471-2476 (1991); Lewis, 
A. P. and J.S. Crowe, Gene, 101: 297-302 (1991)). Nucleic 
5 acids referred to herein as "recombinant" are nucleic acids 
which have been produced by recombinant DNA methodologies 
(recombinantly produced) . Recombinant DNA methodologies 
include, for example, expression of UCP3 in a host cell 
containing or modified to contain DNA or RNA encoding UCP3 
10 or expression of UCP3 using polymerase chain reaction (PCR) 
techniques . 

This invention includes characteristic portions of the 
nucleic acids described herein. As used herein, a 
"characteristic portion" of nucleic acids described herein 
15 refers to portions of a nucleotide sequence which encode a 
protein or polypeptide having at least one property, 
function or activity characteristic of UCP3 protein (e.g., 
predominantly expressed in brown adipose tissue and 
skeletal muscle; activity in regulating thermogenesis in 

2 0 skeletal muscle and brown adipose tissue; selectively 

uncoupling mitochondrial respiration in brown adipocytes 
and skeletal muscle). In addition, the term includes a 
nucleotide sequence which, through the degeneracy of the 
genetic code, encodes the same peptide as a peptide whose 
25 sequence is presented herein (e.g., SEQ ID NO: 1, SEQ ID 
NO: 2, SEQ ID NO: 7) . The nucleic acids described herein 
may also contain a modification of the molecule such that 
the resulting gene product is sufficiently similar to that 
encoded by the unmodified sequence that it has essentially 

3 0 the same activity as the unmodified sequence. An example 

of such a modification would be a "silent" codon 
. substitution or an amino acid substitution, for instance, 
substitution of one codon encoding a hydrophobic amino acid 
.to another codon encoding the same hydrophobic amino acid 
35 or substitution of one acidic amino acid for another acidic 
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amino acid. See Ausubel, F.M., et al . , Current Protocols 
in Molecular Biology, Greene Publ . Assoc. and Wiley - 
Interscience 1989. 

In one embodiment, the nucleic acid or characteristic 
5 portion thereof encodes a protein or polypeptide having at 
least one property, activity or function characteristic of 
a mammalian UCP3 (as defined herein) , such as activity or 
function characteristic of a mammalian UCP3 (as defined 
herein) , such as activity in regulation of thermogenesis in 
10 skeletal muscle and brown adipose tissue. 

The present invention also relates more specifically 
to isolated nucleic acids or a characteristic portion 
thereof, which encode mammalian UCP3 or variants thereof. 

The invention relates to isolated nucleic acids that: 
15 (i) hybridize to (a) a nucleic acid encoding a 

mammalian UCP3 (e.g., human), such as a nucleic acid having 
a nucleotide sequence as set forth or substantially as set 
forth in Figures 1A-1C (SEQ ID NO:l), Figures 2A-2B (SEQ ID 
NO: 2) or Figures 5A-5C (SEQ ID NO: 7) ; (b) the complement 
20 of the sequences of (a) ; or (c) characteristic portions of 
either of the foregoing (e.g., a portion comprising the 
open reading frame) ; 

(2) encode a protein or polypeptide having at least 
one property, activity of, function characteristic of a UCP3 
25 protein (e.g., predominantly expressed in brown adipose 
tissue and skeletal muscle; activity in regulating 
thermogenesis in skeletal, muscle and brown adipose tissue; 
selectively uncoupling mitochondrial respiration in brown 
adipocytes and skeletal muscle ) 
3 0 (3) encode a polypeptide having the amino acid 

sequence of a mammalian UCP3 (e.g., SEQ ID NO: 3, SEQ ID 
NO: 4, SEQ ID NO: 7); or 

(4) have a combination of these characteristics. 
In- one embodiment, the nucleic acid shares at least 
35 about 75% nucleotide sequence similarity, and more 
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preferably, at least about 90% nucleotide sequence 
similarity, to the sequence shown in Figures 1A-1C (SEQ ID 
NO:l), Figures 2A-2B (SEQ ID NO: 2) or Figures 5A-5C (SEQ 
ID NO: 7) . 

5 Isolated nucleic acids meeting these criteria include 

nucleic acids having sequences identical to sequences of 
naturally occurring mammalian UCP3 or variants of the 
naturally occurring sequences which encode mammalian 
(human) UCP3 . Such variants include mutants differing by 

10 the addition, deletion or substitution of one or more 
residues, modified nucleic acids in which one or more 
residues are modified (e.g., DNA or RNA analogs), and 
mutants comprising one or more modified residues. 

Nucleic acids of the present invention may be RNA or 

15 DNA (e.g., cDNA, genomic DNA, and synthetic DNA). The DNA 
may be double -stranded or single-stranded and, if single 
stranded, may be the coding strand or non-coding (anti- 
sense) strand. The coding sequence which encodes the 
polypeptide may be identical to the coding sequence shown 

20 in Figures 1A-1C (SEQ ID NO:l), Figures 2A-2C (SEQ ID 

NO: 2) , Figures 5A-5C (SEQ ID NO: 7) or may be a different 
coding sequence which, as a result of the redundancy or 
degeneracy of the genetic code, encodes the same 
polypeptide as the polypeptide encoded by the DNA of 

25 Figures 1A-1C (SEQ ID NO:l), Figures 2A-2B (SEQ ID NO : 2 ) or 
Figures 5A-5C (SEQ ID NO: 7) . 

The nucleic acid (polynucleotide) which encodes a UCP3 
polypeptide encoded by the UCP3 cDNA may include: only the 
coding sequence of a polypeptide; the coding sequence for a 

3 0 polypeptide and additional coding sequence .such as a leader 
or secretory sequence; the coding sequence for a 
polypeptide (and optionally additional coding sequence) and 
non-coding sequence, such as introns or non-coding sequence 
5' and/or 3' of the coding sequence. 
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Nucleic acids of the present invention, including 
those which hybridize to a selected nucleic acid as 
described above, can be detected or isolated under high 
stringency conditions or moderate stringency conditions, 
5 for example. "High stringency conditions" and "moderate 
stringency conditions" for nucleic acid hybridizations are 
explained at pages 2.10.1-2.10.16 (see particularly 2.10.8- 
11) and pages 6.3.1-6 in Current Protocols in Molecular 
Biology (Ausubel, F.M. et al . , eds . , Vol. 1, Suppl . 26, 
10 1991) , the teachings of which are hereby incorporated by 

reference. Factors such as probe length, base composition, 
percent mismatch between the hybridizing sequences, 
temperature and ionic strength influence the stability of 
nucleic acid hybrids. Thus, high or moderate stringency 
15 conditions can be determined empirically, and depend in 
part upon the characteristics of the known nucleic acid 
(e.g., DNA) and the other nucleic acids to be assessed for 
hybridization thereto. 

Nucleic acids of the present invention that are 
20 characterized by their ability to hybridize (e.g., under 
high or moderate stringency conditions) to (a) a nucleic 
acid encoding a mammalian UCP3 (for example, the nucleic 
acid depicted in Figures 1A-1C (SEQ ID NO:l), Figures 2A-2B 
(SEQ ID NO:2), Figures 5A-5B (SEQ ID NO: 7) or 
25 characteristic portions thereof) ; (b) the complement of the 
nucleic acids of (a) ; or (c) a portion thereof, can also 
encode a protein or polypeptide having at least one 
property, activity or function characteristic of a 
mammalian UCP3 as defined herein, such as activity in 
3 0 regulation of thermogenesis in skeletal muscle and brown 

adipose tissue. In a preferred embodiment the nucleic acid 
encodes a polypeptide which retains substantially the same 
biological function or activity as the polypeptide encoded 
by the 'DNA of Figures 1A-1C (SEQ ID NO:l), or Figures 2A-2B 
35 (SEQ ID NO:2) or Figures 5A-5C (SEQ ID NO: 7) . 
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Nucleic acids of the present invention can be used in 
the production of proteins or polypeptides. For example, a 
nucleic acid (e.g., DNA) encoding a mammalian UCP3 can be 
incorporated into various constructs and vectors created 
5 for further manipulation of sequences or for production of 
the encoded polypeptide in suitable host cells as described 
above . 

A further embodiment of the invention is antisense 
nucleic acid, which is complementary , in whole or in part, 

10 to a UCP3 sense strand, and can hybridize with it. The 

antisense strand hybridizes to DNA, or its RNA counterpart 
(i.e., wherein T residues of the DNA are U residues in the 
RNA counterpart) . When introduced into a cell, atntisense 
nucleic acid hybridizes to and inhibits the expression of 

15 the sense strand. Antisense nucleic acids can be produced 
by standard techniques. 

In another embodiment, the antisense nucleic acid is 
wholly or partially complementary to and can hybridize with 
a target nucleic acid which encodes a mammalian UCP3 . For 

2 0 example, antisense nucleic acid can be complementary to a 
target nucleic acid having the sequence shown as the open 
reading frame in Figures 1A-1C (SEQ ID NO : 1) , Figures 2A-2B 
(SEQ ID N0;2), Figures 5A-5C (SEQ ID NO: 7) or to a portion 
thereof sufficient to allow hybridization. 

25 The nucleic acids can also be used as probes (e.g., 

for in situ hybridization) to assess regulation of 
thermogenesis in skeletal muscle and/or brown adipose 
tissue. The nucleic acids can also be used as probes to 
detect and/or isolate (e.g., by hybridization with RNA or 

30 DNA) polymorphic or allelic variants, for example, in a 
sample (e.g., skeletal muscle, brown adipocytes, .white 
.blood cells) obtained from a host (e.g., a human). 
Moreover, the presence or level of a particular variant in 
a sample (s) obtained from an individual, as compared with 

35 the presence or level in a sample (s) from normal 
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individuals, can be indicative of an association between 
abnormal regulation of thermogenesis (e.g. , obesity) and a 
particular variant, which in turn can be used in the 
diagnosis of the condition. 
5 The present invention also relates to isolated (e.g., 

pure, essentially pure) proteins or polypeptides designated 
mammalian UCP3 and variants of mammalian UCP3 . In a 
preferred embodiment, the isolated proteins of the present 
invention have at least one property, activity or function 
10 characteristic of a mammalian UCP3 (as defined herein) , 

such as activity in regulating (mediating) thermogenesis in 
skeletal muscle and brown adipose tissue or selectively 
uncoupling mitochondrial respiration in brown adipocytes 
and in skeletal muscle. 
15 Proteins or polypeptides referred to herein as 

"isolated" are proteins or polypeptides purified to a state 
beyond that in which they exist in mammalian cells. 
"Isolated" proteins or polypeptides include proteins or 
polypeptides obtained by methods described herein, similar 
20 methods or other suitable methods. They include 

essentially pure proteins or polypeptides, proteins or 
polypeptides produced by chemical synthesis (e.g., 
synthetic peptides) , or by combinations of biological and 
chemical methods, and recombinant proteins or polypeptides 
25 which are isolated. The proteins can be obtained in an 

isolated state of at least about 50 % by weight, preferably 
at least about 75 % by weight, and more preferably, in 
essentially pure form. Proteins or polypeptides referred 
to herein as "recombinant" are proteins or polypeptides 
30 produced by the expression of recombinant nucleic acids. 

As used herein, "mammalian UCP3" protein ref ( ers to 
naturally occurring or endogenous mammalian UCP3s, proteins 
having an amino acid sequence which is the same as that of 
a naturally occurring or endogenous corresponding mammalian 
35 UCP3 (e.g., recombinant proteins), and functional variants 
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of each of the foregoing (e.g., functional fragments and/or 
mutants produced via mutagenesis and/or recombinant 
techniques) . Accordingly, as defined herein, the term 
includes mammalian UCP3 , glycosylated or unglycosylated 
5 UCP3 , polymorphic or allelic variants, and other isoforms 
of mammalian UCP3 (e.g., produced by alternative splicing 
or other cellular processes), and functional fragments. 

Naturally occurring or endogenous mammalian UCP3s 
include wild type proteins such as mammalian UCP3 , 

10 polymorphic or allelic variants and other isoforms which 
occur naturally in mammals (e.g., primate, preferably 
human, murine, bovine) . Such proteins can be recovered 
from a source in which UCP3 is naturally produced, for 
example. These mammalian proteins have the same amino acid 

15 sequence as naturally occurring or endogenous corresponding 
mammalian UCP3 . 

"Functional variants" of mammalian UCP3 include 
functional' fragments, functional mutant proteins/ and/or 
functional fusion proteins. Generally, fragments or 

20 portions of mammalian UCP3 encompassed by the present 
invention include those having one or more amino acid 
deletions relative to the naturally occurring mammalian 
UCP3 protein (such as N- terminal, C- terminal or internal 
deletions) . Fragments or portions in which' only contiguous 

25 amino acids have been deleted or in which non-contiguous 
amino acids have been deleted relative to naturally 
occurring mammalian UCP3 are also encompassed by the 
invention. 

Generally, mutants or derivatives of mammalian UCP3 , 
3 0 encompassed by the present invention include natural or 
artificial .variants differing by the addition, deletion 
, and/or . substitution of one or more contiguous or 
non- contiguous amino acid residues, or modified 
polypeptides in which one or more residues is modified, and 
35 mutants comprising one or more modified residues. For 
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example, mutants can be natural or artificial variants of 
mammalian UCP3 which differ from naturally occurring UCP3 
by the addition, deletion and/or substitution of one or 
more contiguous or non- contiguous amino acid residues. 
5 A "functional fragment or portion", "functional 

mutant" and/or "functional fusion protein" of a mammalian 
UCP3 refers to an isolated protein or oligopeptide which 
has at least one property, activity or function 
characteristic of a mammalian UCP3 , such as activity in 
10 regulating (mediating) thermogenesis in skeletal muscle and 
brown adipose tissue or activity in selectively uncoupling 
mitochondrial respiration in brown adipocytes and in 

• skeletal muscle. 1 • 

Suitable fragments or. mutants can be identified by 
15 screening. For example, the N- terminal , C- terminal, or 

internal regions of the protein can be deleted in a step- 
wise fashion and the resulting protein or polypeptide can 
be screened using a suitable assay, for example, by 
measuring mitochondrial membrane potential in a host cell 
20 expressing UCP3 . Where the resulting protein displays 

activity in the assay, the resulting protein ("fragment") 

is functional. 

The invention also encompasses fusion proteins, 
comprising a mammalian UCP3 as a first moiety, linked to a 
25 second moiety not occurring in the mammalian UCP3 found in 
nature. Thus, the second moiety can be, for example, an 
amino acid, oligopeptide or polypeptide. The first moiety 
can be in' an N- terminal location, C- terminal location or 
internal location of the fusion protein. In one 
30 embodiment, the fusion protein comprises a mammalian UCP3 

or portion thereof as the first moiety, and a second moiety 
comprising an affinity ligand (e.g., an enzyme, an antigen, 
epitope. tag) joined to the first moiety. Optionally, the 
two components can be joined by a linker. 
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Examples of "human UCP3 " include proteins having an 
amino acid sequence as set forth or substantially as set 
forth in Figure 3 (SEQ ID NO: 3, SEQ ID NO: 4) and 
functional portions thereof. An example of "mouse UCP3 11 
5 includes a protein having an amino acid sequence as set 
forth or substantially set forth in Figure 6 (SEQ ID NO: 
8) . In preferred embodiments, a human UCP3 protein, a 
mouse UCP3 protein or a . variant thereof has an amino acid 
sequence which has at least about 75% identity, and more 
10 preferably"; at least about 90% identity, to the protein 

shown in Figure 3 (SEQ ID NO: 3, SEQ ID NO: 4) or Figure 6 
(SEQ ID NO: 8). 

Another aspect of the invention relates to a method of 
producing a human UCP3 or variant (e.g., portion) thereof. 
15 Recombinant protein can be. obtained, for example, by the 
expression of a recombinant DNA molecule encoding a 
mammalian UCP3 or variant thereof in a suitable host cell. 

Constructs suitable for the expression, of a mammalian 
UCP3 or variant thereof are also provided. The constructs 
2 0 can be introduced into a suitable host cell, and cells 
which express a recombinant mammalian UCP3 or variant 
thereof, can be produced and maintained in culture. Such 
cells are useful for a variety of purposes, and can be used 
in the production of protein for characterization, 
25 isolation and/or purification, (e.g., affinity 

purification), and as immunogens , for instance. Suitable 
host cells can be procaryotic, including bacterial cells 
such as E. coli, B. suJbtilis and or other suitable bacteria 
(e.g., Streptococci) or eucaryotic, such as fungal or yeast 
30 cells (e.g., Pichia pastoris, Aspergillus species, 

Saccharomyces cerevisiae, Schizosaccharomyces pombe, 
Neurospora crassa) , or other lower eucaryotic cells, and 
cells of higher eucaryotes such as those from insects 
(e.g., Sf9 insect cells) or mammals (e . g ., Chinese hamster 
35 ovary cells (CHO) , COS cells, HuT 78 cells, 293 cells) . 
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(See, e.g., Ausubel , F.M. et al . , eds . Current Protocols in 
Molecular Biology, Greene Publishing Associates and John 
Wiley Sc Sons Inc., (1993))-. 

Host cells which produce a recombinant mammalian UCP3 
5 or variants thereof can be produced as follows. For 
example, nucleic acid encoding all or part of the UCP3 
protein or a functional portion thereof can be inserted 
into a nucleic acid vector, e.g., a DNA vector, such as a 
plasmid, virus or other suitable replicon for expression. 
10 A variety of vectors is available, including vectors which 
are maintained in single copy or multiple copy, or which 
become integrated into the host cell chromosome. 

The transcriptional and/or translational signals of a 
mammalian UCP3 gene can be used to direct expression. 
15 Alternatively, suitable expression vectors for the 

expression of a nucleic acid encoding all or part of the 
desired protein are available. Suitable expression vectors 
can contain a number of components, including, but not 
limited to, one or more of the following: an origin of 
2 0 replication; a selectable marker gene; one or more 

expression control elements, such as a transcriptional 
control element (e.g., a promoter, an enhancer, 
terminator) , and/or one or more translation signals; a 
signal sequence or leader sequence for membrane targeting 
2 5 or secretion (of mammalian origin or from a heterologous 

mammal or non-mammalian species) . In a construct, a signal 
sequence can be provided by the vector, the mammalian UCP3 
coding sequence, or other source. 

A promoter can be provided for expression in a 
30 suitable host cell. Promoters can be constitutive or 

inducible. The promoter is operably linked to nucleic acid 
encoding the mammalian UCP3 or variant thereof, and is 
capable of directing expression of the encoded polypeptide 
in the 'host cell. A variety of suitable promoters for 
35 procaryotic (e.g., lac, tac, T3 , T7 promoters for E. coli) 
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and eucaryotic (e.g., yeast alcohol dehydrogenase (ADH1) , 
SV40, CMV) hosts is available. 

In addition, the expression vectors typically comprise 
a selectable marker for selection of host cells carrying 
5 the vector, and in the case of a replicable expression 
vector, also comprise an origin of replication. Genes 
encoding products which confer antibiotic or drug 
resistance are common selectable markers and may be used in 
procaryotic (e.g., j3-lactamase gene (ampicillin 

10 resistance) , Tet gene for tetracycline resistance) and 

eucaryotic cells (e.g., neomycin (G418 or geneticin) , gpt 
(mycophenolic acid) , ampicillin, or hygromycin resistance 
genes) . Dihydrof olate reductase marker genes permit 
selection with methotrexate in a variety of hosts. Genes 

15 encoding the gene product of auxotrophic markers of the 

host (e.g., LEU2, URA3 , HIS3) are often used as selectable 
markers in yeast. Use of viral (e.g., baculovirus) or 
phage vectors, and vectors which are capable of integrating 
into the genome of the host cell, such as retroviral 

2 0 vectors, are also contemplated. The present invention also 
relates to cells carrying these expression vectors. 

.For example, a nucleic acid encoding a mammalian UCP3 
or variant thereof is incorporated into a vector, operably 
linked to one or more expression control elements, and the 

25 construct is introduced into host cells which are 

maintained under conditions suitable for expression, 
whereby the encoded polypeptide is produced. The construct 
can be introduced into cells by a method appropriate to the 
host cell selected (e.g., transformation, transf ection, 

30 electroporation, infection) . For production of a protein, 
host cells comprising the construct are maintained under 
conditions appropriate for expression, (e.g., in the 
presence of inducer, suitable media supplemented with 
appropriate salts, growth factors, antibiotic, nutritional 
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supplements, etc.). The encoded protein (e.g., human UCP3) 
can be isolated from the host cells or medium. 

Fusion proteins can also be produced in this mariner. 
For example, some embodiments can be produced by the 
5 insertion of a mammalian UCP3 cDNA or portion thereof into 
a suitable expression vector, such as Bluescript®II SK +/- 
(Stratagene) , pGEX-4T-2 (Pharmacia) , pcDNA-3 (Invitrogen) 
and pET-15b (Novagen) . The resulting construct can then be 
introduced into a suitable host cell for expression. Upon 
10 expression, fusion protein can be isolated or purified from 
a cell lysate by means of a suitable affinity matrix (see 
e.g., Current Protocols in Molecular Biology (Ausubel , F.M. 
et al., eds., Vol. 2, Suppl . 26, pp. 16 . 4 . 1-16 . 7 .;8 (1991)). 
In addition, affinity labels, provide a means of detecting a 
15 fusion protein. For example, the cell surface expression 
or presence in a particular cell fraction of a fusion 
protein comprising an antigen or epitope affinity label can 
be detected by means of an appropriate antibody. 

The UCP3 nucleic acids (DNA, RNA) and protein can be 
20 used in a variety of ways. For example, UCP3 nucleic acids 
and proteins can be used to identify agents (e.g., 
molecules) that alter or modulate (enhance, inhibit) UCP3 
expression and/or function. For example, UCP3 can be 
expressed in a host cell and effects of test compounds on 
25 mitochondrial membrane potential in the host cell could be 
assessed. In addition, evaluation of mitochondrial 
respiration could also be performed in the host cell. 

In one embodiment, the present invention relates to a 
method of identifying an agent which alters UCP3 activity, 
30 wherein a nucleic acid construct comprising nucleic acid 
which encodes a mammalian. UCP3 is introduced into, a host 
cell (s) . The host cells produced are maintained under 
conditions appropriate for expression of the encoded 
mammalian UCP3 , whereby the nucleic acid is expressed. The 
3 5 host cells are then contacted with a compound to be 
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assessed (an agent) and the mitochondrial electrical 
potential (mitochondrial membrane potential) of the cells 
is detected in the presence of the compound to be assessed. 
Detection of a change in mitochondrial electrical potential 
5 in the presence of the agent indicates that the agent 
alters UCP3 activity. In a particular embodiment, the 
invention relates to a method of identifying an agent which 
is an activator of UCP3 activity wherein a nucleic acid 
construct comprising nucleic acid which encodes a mammalian 

10 UCP3 is introduced into a host cell (s) . The host cells 
produced are maintained under conditions appropriate for 
expression of the encoded mammalian UCP3 , whereby the 
nucleic acid is expressed. The host cells are then 
contacted with a compound to be assessed (an agent) and the 

15 mitochondrial electrical potential of the cells is detected 
in the presence of the compound to be assessed. Detection 
of a decrease or reduction of mitochondrial electrical 
potential in the presence of the agent indicates that the 
agent activates UCP3 activity. In another embodiment, the 

2 0 invention relates to a method of identifying an agent which 
is an inhibitor of UCP3 activity, wherein a nucleic acid 
construct comprising nucleic acid which encodes a mammalian 
UCP3 is introduced into a host cell(s). The host cells 
produced are maintained under conditions appropriate for 

25 expression of the encoded mammalian UCP3 , whereby the 
nucleic acid is expressed. The host cells are then 
contacted with a compound to be assessed (an agent) and the 
mitochondrial electrical potential of the cells is detected 
in the presence of the compound to be assessed. Detection 

30 of an increase of mitochondrial electrical potential in the 
presence of the agent indicates that the agent irtfiibits 
UCP3 activity. 

Detection of a change in mitochondrial electrical 
potential can be performed using a variety of techniques. 
35 For example, a change in mitochondrial electrical potential 



I 



WO 98/45438 PCT/US98/06959 

-24- 

can be detected by measuring fluorescence of recombinant 
cells expressing UCP3 . Decrease of fluorescence in the 
presence of the test compound, indicates a decrease of 
mitochondrial membrane potential (mitochondrial A40 , and 
5 vice versa for cases where fluorescence is increased. That 
is, increase of fluorescence in the presence of the test 
compound indicates an increase of mitochondrial Air. If 
decrease in fluorescence is observed in UCP3 expressing 
cells, but not in control cells, then the test compound is 
10 an activator of UCP3 . If an increase in fluorescence is 
observed in UCP3 expressing, cells, but not in control 
cells, then the test compound is an inhibitor of UCP3 . 

In a" particular embodiment, as described in Example 3, 
a high throughput screen can be used to identify agents 
15 that activate (enhance) or inhibit UCP3 activity. For 

example, the method of identifying an agent which alters 
UCP3 activity can be performed as follows. A nucleic acid 
construct comprising nucleic acid which encodes a mammalian 
UCP3 is introduced into a host cell(s) to produce 
20 recombinant host cells. The recombinant host cells 

produced are maintained under conditions appropriate for 
expression of the encoded, mammalian UCP3 , whereby the 
nucleic acid is expressed. A fluorescent dye and the 
compound to be assessed are added to the recombinant host 
25 cells; the resulting combination is referred to as a test 
sample. Fluorescence is detected. A decrease of 
fluorescence in the presence of the test compound occurs 
with a decrease in the mitochondrial electrical potential 
of the cells, which indicates that the agent is an 
30 activator of UCP3 . Conversely, an increase of fluorescence 
in the presence of the test compound occurs with ,an 
increase in the mitochondrial electrical potential of the 
cells, which indicates that the agent is an inhibitor of 
UCP3 . -Suitable dyes for use in this embodiment of the 
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invention include, for example, JC-1, rhodamine 123, 
DiOCc[3], or tetramethylhydrosamine . 

A control can be used in the methods of detecting 
agents which alter UCP3 activity. For example, the control 
5 sample includes the same reagents but lacks the compound or 
agent being assessed; it is treated in the same manner as 
the test sample. 

Also encompassed by the present invention is an agent 
which interacts with UCP3 directly or indirectly, and 
10 inhibits or enhances UCP3 expression and/or function. In 
one embodiment, the agent is an inhibitor which interferes 
with UCP3 directly (e.g., by binding UCP3) or indirectly 
(e.g., by blocking the ability of UCP3 to function in 
thermogenesis) . In a particular embodiment, an inhibitor 
15 of UCP3 protein is an antibody specific for UCP3 protein or 
a functional portion of UCP3 ; that is, the antibody binds 
the UCP3 protein. For example, the antibody can be 
specific for the protein encoded by the amino acid sequence 
of human UCP3 (SEQ ID NO: 3), human UCP3sh (SEQ ID NO: 4), 
20 mouse UCP3 (SEQ ID NO: 8) or portions thereof. 

Alternatively, the inhibitor can be an agent other than an 
antibody (e.g., small organic molecule, protein or peptide) 
which binds UCP3 and blocks its activity. For example, the 
inhibitor can be an agent which mimics UCP3 structurally, 
25 but lacks its function. Alternatively, it can be an agent 
which binds to or interacts with a molecule which UCP3 
normally binds with or interacts with, thus blocking UCP3 
from doing so and preventing it from exerting the effects 
it would normally exert. 
3 0 In another embodiment, the agent is an enhancer 

(activator) of UCP3 which increases the activity .of UCP3 
(increases the effect of a given amount or level of UCP3) , 
increases the length of time it is effective (by preventing 
its degradation or otherwise prolonging the time during 
35 which it is active) or both either directly or indirectly. 
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For example, UCP3 nucleic acids and proteins can be used to 
identify anti- obesity drugs which enhance UCP3 to induce 
uncoupling in brown fat and/or skeletal muscle, with 1 the 
result that stored energy is released as heat. 
5 In another embodiment,, the sequences described herein 

can be used to detect UCP3 or DNA encoding UCP3 in a 
sample. For example, a labeled nucleic acid probe having 
all or a functional portion of the nucleotide sequence of 
UCP3 can be used in a method to detect UCP3 in a sample. 

10 In one embodiment, the sample is treated to render the 

nucleic acids in the sample available for hybridization to 
a nucleic acid probe, which can be DNA or RNA. The 
resulting treated sample is combined with a labeled nucleic 
acid probe having all or a portion of the nucleotide 

15- sequence of UCP3 , under conditions appropriate for 
hybridization of complementary sequences to occur. 
Detection of hybridization of nucleic acids, from the sample 
with the labeled nucleic probe indicates the presence of 
UCP3 in a sample. The presence of UCP3 mRNA is indicative 

2 0 of UCP3 expression. Such a method can be used, for 

example, as a screen for normal or abnormal thermogenesis 
in skeletal muscle or brown adipose tissue. 

Alternatively, a method of detecting UCP3 in a sample 
can be accomplished using an antibody directed against UCP3 
25 or a portion of UCP3 . Detection of specific binding to the 
antibody indicates the presence of UCP3 in the sample 
(e.g., ELISA) . This could reflect a pathological state 
associated with UCP3 and, thus, can be used diagnostically . 
The sample for use in the methods of the present 

3 0 invention includes a suitable sample from, for example, a 

mammal, particularly a human. For example, the sample can 
be blood, skeletal muscle or brown adipose tissue. 

The UCP3 sequences of the present invention can also 
be used to generate nonhuman gene knockout animals, such as 
35 mice, which lack UCP3 and transgenically overexpress UCP3 . 
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For example, such UCP3 gene knockout mice can be generated 
and used to obtain further insight into the function of 
UCP3 as well as assess the specificity of UCP3 activators 
and inhibitors. Also, overexpression of UCP3 (e.g., human 
5 UCP3) in transgenic mice can be used as a means of creating 
a test system for UCP3 activators and inhibitors (e.g., 
against human UCP3) . In addition, the UCP3 gene can be 
used to clone the UCP3 promoter/enhancer in order to 
identify regulators of UCP3 transcription. UCP3 gene 
10 knockout animals include animals which completely or 
partially lack the UCP3 gene and/or UCP3 activity or 
function. 

As described herein, it is likely that UCP3 ;plays a 
role in controlling protein- wasting and production of 

15 gluconeogenic precursors by skeletal muscle via transport 

of one or more metabolites, which indicates that inhibitors 
of UCP3 can be. used as a means of curtailing muscle wasting 
due to, for example, infection, (e.g., human, 
immunodeficiency virus) cancer, tumor cachexia, muscle 

20 diseases (e.g., muscular dystrophy) or as a possible 
treatment for non- insulin dependent diabetes mellitus 
(NIDDM) . 

Thus the present invention relates to a method of 
inhibiting (partially, completely) protein catabolism in a 

25 mammal (e.g., human) comprising administering to the mammal 
an effective amount of an inhibitor of UCP3 . The invention 
also relates to a method of enhancing protein catabolism in 
a mammal comprising administering to the mammal an 
effective amount of an enhancer UCP3 . Also encompassed by 

3 0 the present invention is a method of inhibiting muscle 

wasting in a mammal comprising administering an effective 
amount of an enhancer of UCP3 to the mammal . 

A number of studies have demonstrated that brown 
adipose tissue plays an important role in regulating energy 

35 balance in rodents (Himms-Hagen, J . , Prog . Lipid Res., 
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28:67-115 (1989)). The tissue is highly specialized for 
stimulated energy expenditure with a rich vascular supply, 
dense sympathetic innervation, and numerous mitochondria. 
Importantly, brown adipocytes are further distinguished 
5 from other cell types by their expression of all three 

uncoupling proteins: UCP1, which is expressed exclusively 
in brown adipocytes, UCP2 , which is expressed widely 
(Fleury, C, etal., Nature Genetics, 15:269-272 (1997); 
Gimeno, R.E., et al - , Diabetes, in press (1997)) and, as 
10 demonstrated herein, UCP3 which is expressed selectively 
and abundantly in brown adipocytes and skeletal muscle. 
These features make brown fat ideally suited to regulated 
thermogenesis . ! • 

In contrast to rodents, brown adipose tissue in large 
15 mammals is relatively limited and therefore brown fat may 
not be a significant regulator of human energy expenditure. 
A number of studies in humans have implicated skeletal 
muscle as an important mediator of adaptive thermogenesis 
in humans-' (Astrup, A., et al . , Am. J. Physiol., 248:E507- 
20 515 (1985); Astrup, A., et al., Am. J. Physiol., 257:E340- 
345 (1989); Zurlo, F., et al . , J . Clin. Invest., 86:1423- 
1427 . (1990) ; Simonsen, L. , et al . , Am. J . Physiol., 
263:E850-855 (1992); Spraul , M. , etal., J. Clin. Invest., 
92:1730-1735 (1993)). Approximately 80% of the variance in 

2 5 resting energy expenditure between individuals can be 

accounted for by differences in fat-free mass (Ravussin, 
E., et al., Am. J. Clin. Nutr., 55:242S-245S (1992)), much 
of which is skeletal muscle. Similarly, a perfused forearm 
study has demonstrated that differences in skeletal muscle 

3 0 energy expenditure account for much of the variation in 

metabolic rate observed between individuals (Zurlo, F., et 
al., J. Clin. Invest., 86 :-1423 - 1427 (1990)). Regulated 
energy expenditure in skeletal muscle is controlled, in 
large part, by sympathetic stimulation ((Astrup, A., et 
35 al., Am. J. Physiol., 248:E507-515 (1985); Astrup, A., et 



\VO 98/45438 PCT/US98/06959 

-29- 

al., Am. J. Physiol., 257:E340-345 (1989); Simonsen, L . , et 
al., Am. J. Physiol., 263 :E850-855 (1992); Spraul , M . , et 
al., J. Clin. Invest., 52:1730-1735 (1993)). It is 
interesting to note that brown fat and skeletal muscle have 
5 many features in common: a rich blood supply, a dense 
sympathetic innervation, and abundant mitochondria. In 
addition, both tissues express high levels of UCP3 mRNA. 

The heart continuously expends large amounts of energy 
in order to maintain blood circulation. In view of this, 

10 it is probably significant that UCP3 is minimally expressed 
in cardiac tissue. This is especially true given the 
general tendency for non-contractile muscle-specific genes 
to be expressed in both striated muscle types (skeletal and 
cardiac) . Abundant expression of UCP3 in two thermogenic 

15 tissues, skeletal muscle and brown fat, and relative lack 
of expression in other sites such as the heart, 
demonstrates that UCP3 is an important molecular mediator 
of adaptive thermogenesis . 

Thus, the present invention provides for anti-obesity 

2 0 drug development wherein the UCP3 nucleic acids and protein 
can be used to identify, for example, enhancers 
(activators) of UCP3 which can be used to induce 
uncoupling. /? 3 -adrenergic receptor agonists, which 
increase UCP1 expression and activity in brown fat are 

25 presently under development, but may have limited effects 
given the paucity of brown fat in humans. UCP2 is another 
potential target. However, it is expressed in a number of 
critical organs and tissues and its activation could 
produce unwanted side effects. Specific activators of UCP3 

30 expression and/or function, on the other hand, selectively 
increase energy expenditure in skeletal muscle and brown 
t fat, two tissues that have the capacity for adaptive energy 
expenditure. 



WO 98/45438 PCT/US98/06959 



-30- 

The present invention is further illustrated by the 
following examples, which are not intended to be limiting 
in any way. 

EXAMPLE 1 CLONING AND CHARACTER I ZAT I ON OF THE UCP3 GENE 

5 Northern Blot Assays 

Human Multiple Tissue Northern Blots (#7760-1, #7759-1 
and If 77 67-1) containing approximately 2 fig of polyA RNA per 
lane were purchased from Clontech Laboratories (Palo Alto, 
CA) . All hybridizations, membranes washes and membrane 
10 strippings were performed • according to manufacturer's 

specifications. The blots were first hybridized to a hUCP3 
probe, washed and exposed to film for 1-18 hours, then 
stripped, rehybridized to a hUCP2 probe and exposed to film 
for 18 hours. The hUCP3 probe was a 293 bp fragment 
15 corresponding to residues #211-308. The hUCP2 probe was a 
1125 bp fragment spanning the entire open reading frame. 
The specific activities of both hybridization probes were 
similar. 'Mouse Northern blots were generated using total 
RNA isolated from a number of tissues and equal loading of 

20 lanes was established using ethidium bromide florescence. 
The mouse Northern blots were hybridized using the hUCP3 
probe described above . 

RNase Protection 

Total RNA was extracted from adipose tissue the method 

25 of Chomczynski and Sacchi (Chomczynski, P. and Sacchi, N. , 
Anal. Biochem., 162:156-159 (1987)). Skeletal muscle and 
heart RNA was obtained from Clontech. Aliquot s of 1, 3,5 
and 10 fig of adipose tissue and skeletal muscle I?NA and 10 
/ig aliquot of heart RNA were used for determination of UCP3 

30 and mRNA levels. The Rnase protection assay was performed 
as previously described (Vidal-Puig, A., et al . , J . Clin. 
Invest., 57:2553-2561 (1997)). A UCP-3 cDNA fragment was 
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generated by reverse transcriptase- PCR using total RJSTA from 
human muscle as follows: two primers (5'GGA CTA CCA CCT GCT 
CAC TG 3' (SEQ ID NO: 23) and 5' CCC GTA ACA TAT GGA CTT 
T3 ' (SEQ ID NO: 24)) were designed to amplify 302 bp of the 
5 hUCP-3 sequence corresponding to residues #209-308. The 
PCR product was subcloned into PGMT easy TA cloning vector 
(Promega Corp., Madison, WI) and linearized for riboprobe 
synthesis using Spe I. Identity and orientation of the 
UCP3 probe was confirmed by sequencing. The antisense 
10 [ 32 P] -labeled UCP3 template was synthesized using T& RNA 
polymerase. A human cyclin riboprobe was used as an 
internal control (Ambion, Inc., Austin, TX) . 

Results 

As described herein, a third uncoupling homologue 

15 designated UCP3 has been cloned. It is distinguished from 
UCP1 and UCP2 by its selective expression in skeletal 
muscle and brown adipose tissue, two important sites for 
regulated energy expenditure in humans (Astrup, A., et al., 
Am. J. Physiol., 248:E507-515 (1985); Astrup, A., et al . , 

20 Am. J. Physiol., 257:E340-345 (1989); Zurlo, F., et al . , J. 
Clin. Invest.., 36:1423-1427 (1990); Simonsen, L., et al . , 
Am. J. Physiol., 263 :E850-855 (1992); Spraul , M . , et al., 
J. Clin. Invest:., 92:1730-1735 (1993)) and rodents (Himms- 
Hagen, J., Prog. Lipid Res., 25:67-115 (1989)). At the 

25 amino acid level, hUCP3 is 71% identical to hUCP2 and 57% 
identical to hUCPl . Because UCP3 is abundantly and 
selectively expressed in skeletal muscle and brown adipose 
tissue, UCP3 is likely to be an important mediator of 
regulated thermogenesis in humans . Since UCP3 is minimally 

3 0 expressed in heart and other critical organs, it .is a 

.promising target for anti-obesity drug development aimed at 
increasing thermogenesis. 
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The expressed sequence tag (EST) database 
(http://www.ncbi.nlm.gov) was screened for sequences 
homologous to UCP1 . One human EST , deposited by the* 
Washington University, St. Louis - Merck & Co. EST project, 
was identified which was similar but not identical to hUCPl 
and hUCP2 (accession no. AA192136, IMAGE clone no. 628529). 
This clone originated from a human skeletal muscle cDNA 
library (#937209, Stratagene, La Jolla, CA) . The bacterial 
stock for clone 628529 was obtained from Genome Systems 
(St. Louis, MI) and was found to contain an insert of 
approximately 1.3kb, which included the C-terminal third of 
the open reading frame. The coding region within clone 
628529 was fully resequenced. Full-length cDNA sequences 
were generated using the Marathon cDNA Amplification Kit, 
15 human skeletal muscle Marathon-Ready cDNA (both from 
Clontech Laboratories, Palo Alto, CA) and an antisense 
primer (5'-TTC ACC ACG TCG ACC CGG GGG GAT GCC ACC-3') (SEQ 
ID NO: 25) corresponding to the coding sequence presumed to 
represent. hUCP3 . 

UCP3 cDNA sequence contains a 5' untranslated region 
of at least 183 bases, an open reading from of 936 bases, a 
3' untranslated region of approximately 1.1 kb, a 
polyadenylation signal and a polyA tail (Figures 1A-1C) . 
The UCP3 .mRNA transcript is predicted to be equal to or 
25 greater than 2.2 kb. UCP3 protein, as deduced from the 

open reading frame, is composed of 312 amino acids and is 
estimated to have a molecular weight of approximately 34 kD 
(Figure 3). As shown in Figure 3, at the amino acid level, 
hUCP3 is 71% identical to hUCP2 and 57% identical to hUCPl; 
30 and hUCP2 is 59% identical to hUCPl . Many, of the 
nonidentical residues in hUCP3 are conservative ( 
substitutions which in most cases correspond to residues 
found in either mUCP2 (Fleury, C., et al., Nature Genetics, 
15:269-272 (1997); Gimeno, R.E., et al. 7 Diabetes, 46:900- 
35 906 (1997)) or in UCP1 from various species (Klaus, S., et 
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al-, Jut. J. Biochem., 23:791-801 (1991)). The data, based 
upon the high degree of homology between UCP1, UCP2 and 
UCP3 , demonstrates that UCP3 uncouples mitochondrial 
respiration. 

5 In order to establish the tissue distribution of UCP3 

in humans, Northern blot analyses were performed. UCP3 was 
abundantly expressed in skeletal muscle, generating a 
dominant mRNA transcript of approximately 2.4 kb. With 
longer exposure (18 hours), a much weaker UCP3 signal (2.4 
10 kb) was detected in a large number of other tissues and 

organs. The longer exposures (18 hours) of the human UCP3 
Northern blots also revealed the presence of a smaller mRNA 
transcript which had a similar size (approximately 1.6 kb) . 
Of note, the 294 bp hUCP3 probe employed was 75% identical 
15 to hUCP2. Rehybridization of the same blots with hUCP2 
confirmed that this smaller 1.6 kb signal was UCP2 . The 
UCP2 signal, as previously reported (Fleury, C, et al . , 
Nature Genetics, 15:269-272 (1997); Gimeno, R.E., et al . , 
Diabetes, 446:900-906 (1997)) was widely expressed. It was 
2 0 being most abundant in spleen, thymus, bone marrow, 

trachea, and lymph node, and somewhat less abundant in 
skeletal muscle as well as a number of other tissues. UCP2 
was also abundantly expressed in white adipose tissue as 
reported Gimeno, R.E., etal., Diabetes, 446:900-906 
25 (1997)). A comparison of hybridization signals for UCP2 

and UCP3 suggests that UCP3 may be the dominant uncoupling 
protein transcript in human skeletal muscle. 

A sensitive RNase protection assay was used to assess 
UCP3 mRNA expression in heart, skeletal muscle and white . 
30 adipose tissue. No UCP3 signal could be detected in white 
adipose tissue. In heart, a very weak UCP3 signal was 
> detected. The signal in heart was less than 1% of that 
detected in skeletal muscle. 

In mice, abundant UCP3 expression was detected in 
35 skeletal muscle and brown fat . As with humans, little or 
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no UCP3 expression was detected in other mouse tissues such 
as white adipose tissue, brain, kidney, liver and colon. 
As was observed in the human mRNA studies, a smaller' 
transcript was detected in mouse samples as well. This 
5 smaller transcript most likely represents mUCP2 given that 
it was most abundant in white adipose tissue, a site of 
high-level UCP2 expression t (Fleury, C, et al . , Nature 
Genetics, 15:269-272 (1997); Gimeno, R.E., et al . , 
Diabetes, in press (1997)). Of note, the hUCP3 probe is 
10 73% identical to mUCP2 . 

Figure 4 is a hydrophilicity plot of human UCP2 and 
human UCP3 showing the hydrophobicity of protein across 
linear sequence. 

EXAMPLE 2 Discovery of an Alternative Form of UCP3 , 
15 Designated UCP3-short form (UCP3sh) 

As discussed above, the genomic organization of the 
human UCP3 gene has been defined. In addition, it has been 
determined that the UCP3 gene generates two mRNA 
transcripts, UCP3 and UCP3- short form (UCP3sh) . The 

2 0 nucleotide sequence of UCP3sh mRNA is shown in Figures 2 A- 

*i *■ ■ 

2B. The UCP3sh transcript encodes a shortened version of 
the UCP3 protein. As shown in Figure 8, the UCP3sh 
transcript results when a polyadenylation/transcription 
termination signal (AATAAA) (SEQ ID NO: 26) located within 

25 intron 6 terminates transcription (see Figure 3) . However, 
this AATAAA (SEQ ID NO: 26) seems to be only partially 
effective in terminating transcription. When it does 
succeed in terminating transcription, the UCP3sh transcript 
is generated. When it fails to terminate transcription, 

3 0 transcription continues on through exon 7 and terminates at 
the exon 7 AATAAA (SEQ ID NO: 26) signal. Splicing between 
exon 6 'and exon 7 then occurs to generate the UCP3 
transcript. . 
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As shown is Figure 3, UCP3sh differs from UCP3 only by 
the absence of the last 37 amino acids. It is reasonable 
to expect that this is significant, since the region 
missing in UCP3sh is highly homologous to a region in UCP1 
5 which has been implicated in mediating inhibition of 

uncoupling activity by purine nucleotides (Murdza- Inglis , 
D.L., et al., J Biol Chem. 263:7435-7438 (1994)), .As a 
result, it is reasonable to expect that UCP3sh is more 
active as an uncoupler than UCP3 . 

10 Using a quantitative RNase protection assay similar to 

that described in Example 1, it was determined that UCP3sh 
mRNA, like UCP3 mRNA is extremely abundant in human 
skeletal muscle. In normal individuals, the levql of 
UCP3sh mRNA is skeletal muscle is equal to or greater that 

15 the level of UCP3 mRNA. Preliminary studies have indicated 
that UCP3sh mRNA levels are reduced in obese individuals 
compared to lean individual. In contrast, UCP3 mRNA levels 
seem to be unchanged in obese individuals. These 
preliminary findings raise the possibility* that UCP3sh is 

2 0 the more important UCP3 protein for body weight regulation. 

EXAMPLE 3 Cloning of mouse UCP3 gene 

Using the human UCP3 gene, the mouse UCP3 gene was 
isolated using methods similar to those described in 
Example 1. The mouse UCP3 nucleotide sequence (SEQ ID NO: 
25 7) is shown in Figures 5A-5C, and the mouse UPC3 amino acid 
sequence is shown in Figure 6 . Comparisions of mUCP3 
versus mUCPl and mUP2 and human UCP3 are shown in Figure 7. 

EXAMPLE 4 Monitoring of JC-1 fluorescence in living cells 

An assay which utilizes fibroblast -like cells lines 
30 expressing - recombinant human UCP3 , and a fluorescent dye 
(e.g., JC-1) makes it possible to rapidly . assess 
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mitochondrial membrane potential (A^) in living cells 
(Smiley, S.T., et al . , Proc. JVatl. Acad. Sci . USA, 88:3671- 
3675 (1991); Reers , M . , et'al., Methods in Enzymology, 
250:406-417 (1995)). Any drug which increases UCP3 
5 activity is expected to reduce At, and therefore, reduce 
"red" -fluorescence of JC-1. By comparing effects of test 
compounds on fluorescence in a cell line expressing UCP3 
with a control (e.g., cells which do not express UCP3 ; 
cells which express UCP3 in the absence of the test 

10 compound), it is possible to identify specific activators 
and inhibitors of UCP3 . The cells can be grown in 96 well 
plates, and the plates can be read directly in a 
fluorometer designed to handle 96 well plates, it is 
possible to perform this assay in a high- throughput 

15 fashion. 

Recombinant cells expressing hUCP3 and cells not 
expressing UCP3 are grown in 96 well plates. On the day of 
analysis, the plates are rinsed and JC-1 dye is added to 
all wells plus or minus test compounds. Later, plates are 

2 0 washed and then, in the presence of the test compound, 

fluorescence is determined in a fluorometer. Decrease of 
fluorescence in the presence of the test compound, 
indicates a decrease of mitochondrial AJ' (and vice versa 
for cases where fluorescence is increased) . That is, 
25 increase of fluorescence in the presence of the test 

compound indicates an increase of mitochondrial Mr. If 
decrease in fluorescence t l is observed in UCP3 expressing 
cells but not in control cells, then the test compound is 
an activator of UCP3 . If an increase in fluorescence is 

3 0 observed in UCP3 expressing cells, but not in control 

cells, then the test compound is an inhibitor of ,UCP3 . 

Any dye can be used in the high- throughout screen, 
such as JC-1, rhodamine 123, DiOCc[3], or 

tetramethylhydrosamine . In a particular embodiment, JC-1 
35 dye, a delocalized lipophilic cation (DLC) , can be used. 
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The distinguishing feature of DLCs is that they are 
positively charged, yet lipophilic. The lipophilic feature 
allows then to traverse membranes and the positive charge 
causes then to accumulate within mitochondria (negatively 
5 charged on the inside) . This accumulation is proportional 
to A X I', the membrane electrical potential across the inner 
mitochondrial membrane, and follows the Nernst Equation 
shown below. The mitochondrial Mr results from the protein 
electrochemical gradient across the inner mitochondrial 
10 membrane and represents the electrical portion of this 
gradient (ApH represents the chemical portion of the 
gradient) . 

Mr = -60 log F in /F oul F = concentration of DLC 

Thus, a At of -60 mV corresponds to a DLC in/out ratio of 
15 10 to 1, and a A^ of -120 mV, corresponds to a DLC in/out 
ratio of 100 to 1. Thus, a change in A* is amplified by a 
change in F in /F oul . Of note, Mr for most mitochondrial range 
between -50 mV and -160 mV . 

Protonophore uncouplers such as DNP (dinitrophenol ) , 
20 CCCP ■ (carbonyl cyanide m-chlorophenyllhydrazone) , decrease 
Mr and, as a result, markedly decrease the accumulation of 
JC-1. Any drug which increases UCP activity is expected to 
have the same effect as DNP, CCCP or FCCP. 

JC-1 has fluorescent features which makes it extremely 
2 5 useful as a monitor of mitochondrial Mr. Many dyes 
aggregate at high concentrations and this reduces 
fluorescence greatly (for example, rhodamine 123) . 
Aggregates of JC-1 fluoresce intensely, and at higher 
wavelength than JC-1 monomers. Specifically, monomers emit 
30 * at 527 nM (green) while J-aggregates emit at. 590 nM (red). 
Thus, high concentrations of JC-1 accumulate in 
mitochondria permitting the formation of aggregates. The 
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accumulation of JC-1 and therefore the formation of 
aggregates is proportional to mitochondrial Mr. Aggregates 
do not form in other cellular locations due to insufficient 
accumulation of JC-1. Thus, detection of aggregates (as 
5 measured by fluorescence at 590 nM) is a sensitive 
indicator of mitochondrial Mf . 

CX-1 cells were incubated with JC-1 (lOug/ml) with or 
without the uncoupler, FCCP, for 10 minutes, washed 3 
timei?, trypsinized and then transferred as a cell 
10 suspension to a 1 cm quartz cuvette, in which fluorescence 
was monitored using a Kontron SFM25 fluorescent 
spectrophotometer . 



Fluorescence (in arbitrary 
520 nM 
(green) 


units) 


590 nM 
(red) 




CX-1 cells 


90 


90 


CX-1 cells + FCCP 


80 . 


10 



The data shows that JC-1 aggregate fluorescence can be 
monitored' in living cells and that an uncoupler (FCCP) 
which is "expected to have the same effect as a UCP 
activator markedly lowers "red" fluorescence. Fluorescence 
5 can also be monitored using a FACScan flow cytometer or in 
a single cell using fluorescence microscopy. 

EXAMPLE 5 UCP3 GENE EXPRESSION: Tissue Distribution and 
Physiologic Regulation 

Tissue Distribution - In humans, UCP3 is expressed 
10 abundantly and preferentially in skeletal muscle. In rats, 
UCP3 is expressed abundantly in skeletal muscle and brown 
fat. 

Starvation - UCP 3 was dramatically increased by 
starvation in mice and rats (-5-10 fold) . In humans, it 
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has been shown that 5 days of food restriction causes a 
2. 5- fold increase in UCP3 mRNA expression. Also, it was 
found that human UCP3 mRNA is significantly upregulated 
when transgenic mice bearing a human UCP3 PI clone are 
5 starved. Thus, it is likely that humans, like rodents, 
increase UCP3 gene expression with starvation. 

Role of FFAs - Recently, it was shown that treatment 
of fed rats with Intralipid plus heparin (which produced an 
increase in free fatty acids (FFAs) from 0.26 to 2.04 mM) 

10 caused a 3-fold increase in UCP3 (Weigle D.S., Diabetics, 
47:298-302 (1998)). Based upon this observation, it was 
suggested that the increase in FFAs with starvation was 
responsible for the effects of starvation on UCP3I mRNA 
levels. It was speculated that "this induction of UCP3 may 

15 be linked to the utilization of free fatty acids as a 

fuel". As discussed below however, it is unlikely that 
this hypothesis is true. 

Starvation plus Nicotinic Acid - 1 day fasted rats 
were treated with saline or nicotinic acid for 6 hours and 

20 the effects on UCP3 gene expression were assessed. 

Starvation increases lipolysis in adipose tissue, causing a 
marked increase in blood levels of FFAs. The increase in 
FFAs is thought to promote conservation of protein in 
skeletal muscle (when lipid fuels are abundant, the 

2 5 requirement for gluconeogenesis from muscle protein is 

reduced) . Nicotinic acid inhibits lipolysis, restores FFA 
levels to fed values, and . stimulates protein catabolism in 
skeletal muscle (Lowell and Goodman, Diabetics, 35:14-19 
(1987) ) . The experiment described herein shows that 

3 0 nicotinic acid treatment of fasted animals returned FFA 

. levels to fed values, but increased UCP3 mRNA to .levels 2- 
. fold higher than those observed in saline treated fasted 
controls. This observation shows that the starvation- 
induced rise in FFAs is not responsible for the effects of 
35 starvation on UCP3 mRNA levels. Also, it shows that UCP3 
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is not linked to the utilization of FFAs as fuel. Instead, 
based upon this finding it is reasonable to expect that 
UCP3 is linked to protein catabolism in skeletal muscle. 
Streptozotocin Diabetes - Fourteen days of 
5 streptozotocin diabetes in rats produced a very large 
increase in UCP3 mRNA levels. This rise in UCP3 was 
reversed with one day of insulin treatment. Streptozotocin 
diabetes is associated with significant protein catabolism 
in skeletal muscle. . . 

10 Endotoxin - Endotoxin treatment of rats and mice 

resulted in a very large increase in UCP3 mRNA levels in 
skeletal muscle, but not in other tissues. Endotoxin is a 
well known stimulator of protein catabolism in skeletal 
muscle. 

15 Dexamethasone - High dose dexamethasone treatment 

markedly stimulated UCP3 mRNA levels in skeletal muscle, 
but not in other tissues. Dexamethasone is also a well 
known stimulator of protein catabolism in skeletal muscle. 
Thyroid Hormone - High dose thyroid treatment in rats 

20 stimulated UCP3 mRNA levels. Thyroid hormones seemed to 

have little or no effect, in mice. Thyroid hormone is also 
a well known stimulator of protein catabolism in skeletal 
muscle. 

ob/ob and db/db mice: fa/fa rats - These genetically 
25 obese rodents were generated and shown to have markedly 
increased UCP3 mRNA levels in skeletal muscle. It is 
likely that increased UCP3 mRNA levels in ob/ob mice 
contributed to elevated production of gluconeogenic 
precursors by muscle, thereby promoting non- insulin 
30 dependent diabetes mellitus (NIDDM) in these animals. 

It is interesting to note that nearly all pqsitive 
regulators of UCP3 gene expression (starvation, nicotinic 
acid treatment during starvation, streptozotocin diabetes, 
endotoxin, dexamethasone and thyroid hormone) are 
35 associated with catabolism of skeletal muscle protein (see 
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Mitch and Goldberg, NEJM, 335:1897-1905 (1996.)). The only 
exceptions to this are genetically obese rodents (however, 
these animals do have decreased muscle mass) . From another 
perspective, it is also true that all catabolic states 
5 tested to date are associated with increased UCP3 
expression. 

Given that increased UCP3 gene expressions is linked 
to states of augmented skeletal muscle protein catabolism, 
it is likely that UCP3 plays an important role in 
10 regulating skeletal muscle protein catabolism (conversion 
of muscle protein to gluconeogenic precursors) . Possible 
mechanisms by which UCP3 plays a role are the following: 

a) UCP3 is a mitochondrial carrier which transports 
biosynthetic metabolites in and out of mitochondria 

15 during skeletal muscle protein catabolism (i.e., 

conversion of aspartate, glutamate, valine, isoleucine 
and leucine to gluconeogenic precursors alanine and 
glut amine) . . 

b) UCP3 is the aspartate/glutamate carrier and is rate 

20 limiting for operation of the aspartate/malate shuttle 

. (transfers cytosolic NADH into the mitochondria) . 
Increased operation of this shuttle would reduce the 
cytosolic NADH /NAD ratio. It has been suggested that 
the cytosolic NADH /NAD ratio regulates muscle protein 

25 catabolism. 

c) UCP3 is indeed a genuine uncoupling protein and 
increased UCP3 activity in catabolic states oxidizes 
the whole cell redox state (NADH/NAD ratio) , - thereby . 
stimulating protein catabolism and amino acid 

30 metabolism. 

Skeletal Muscle Metabolism. During Starvation (and other 
catabolic states) . 
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During starvation, muscle mobilizes actin and myosin 
protein and releases gluconeogenic precursors into the 
blood (primarily alanine and glutamine) . This resporise is 
critical for survival. In the absence of gluconeogenesis 
5 from muscle protein, blood glucose levels would fall 
during starvation and brain dysfunction would occur. 

The amino acids released from muscle protein are 
significantly metabolized inside the myocytes prior to 
their release into the bloodstream. Alanine and glutamine 
10 represent approximately 12% amino acids in muscle protein 
but together represent > 50% of amino acids released by 
muscle during starvation. Thus, much of the alanine and 
glutamine released must be. ^synthesized. In contrast, 
aspartate, asparginine, glutamate, leucine, isoleucine and 
15 valine represent > 30% of amino acids in muscle protein but 
are released in only small amounts during starvation. These 
amino acids are interconverted to alanine and glutamine by 
muscle. Other amino acids such as glycine, cysteine, 
serine, threonine, methionine, proline, lysine, arginine, 
20 histidine, phenylalanine, tyrosine and tryptophan 

represents approximately 50% of muscle protein and are 
released either unchanged or as deaminated cx-ketoacids . 

Alanine is generated by the transamination of 
pyruvate. The pyruvate (i.e., carbon) for alanine 

2 5 synthesis come from glycolysis while the nitrogen 

originates from aspartate, asparginine, glutamate, leucine, 
isoleucine and valine. The released alanine is taken up by 
the liver and used to synthesize glucose. The glucose is 
then returned to the muscle and is metabolized into 
30 pyruvate, thus completing the glucose-alanine cycle. It is 
important to note that no new glucose is synthesized by 
this process, the carbons are simply recycled. Thus, the 
glucose-alanine cycle functions to conserve carbohydrate, 
but doe's not generate new carbohydrate. The cycle also 

3 5 functions to transfer NH2 from amino acids with are 
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metabolized (aspartate, asparginine, glutamate, leucine, 
isoleucine and valine) to the liver where it can be 
detoxified via the urea cycle. 

Because certain tissues are oxidizing glucose to C0 2 
5 (i.e., the brain), new glucose must be synthesized during 
starvation. This new glucose is synthesized from 
glutamine, which is released by muscle. The carbon 
backbone for glutamine comes from aspartate, asparginine, 
glutamate, isoleucine and valine, while the nitrogen comes 
10 from these same amino acids plus leucine. The leucine 
carbon backbone is completely oxidized to C0 2 by muscle. 
The glutamine released by muscle is taken up by the kidney 
and intestines, where a complex pathway is initiated 
culminating in the synthesis of glucose. Glutamine 
15 synthetase is the enzyme which converts glutamate to 

glutamine, the final step in glutamine synthesis. " It is 
interesting to note that glutamine synthetase gene 
expression in muscle, like UCP3 gene expression, is 
induced by starvation, streptozotocin diabetes, endotoxin 
20 treatment and dexamethasone . It is also interesting to 
note, as was seen with UCP3, that these effects on 
glutamine synthetase gene expression are observed in 
skeletal muscle, but not in other tissues. 

Significant mitochondrial metabolism must occur in 
25 order for aspartate, asparginine, glutamate, isoleucine, 
valine and leucine to be interconverted to alanine and 
glutamate. This is because important enzymes involved in 
the interconversion are located within the mitochondrial 
matrix. One example is branched chain a-ketoacid 
3 0 dehydrogenase (BCKADH) , an enzyme which initiates the 

oxidation of leucine, isoleucine and valine. Of .interest, 
' BCKADH activity in muscle increases significantly during 
starvation and streptozotocin diabetes . Thus, metabolites 
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must flux in and out of mitochondria for muscle to release 
alanine and glutamine during catabolic states. 

EQUIVALENTS 

Those skilled in the art will recognize, or be able to 
5 ascertain using no more than routine experimentation, many- 
equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be 
encompassed by the following claims. 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

AGGAGGGGCC ATCCAATCCC TGCTGCCACC TCCTGGGATG GAGCCCTAGG GAGCCCCTGT 60 

GCTGCCCCTG CCGTGGCAGG ACTCACAGCC CCACCGCTGC ACTGAAGCCC AGGGCTGTGG 12 0 

AGCAGCCTCT CTC CTTGGAC CTCCTCTGGG CCCTAAAGGG ACTGGG C AG A GCCTTCCAGG 180 

ACTATGGTTG GACTGAAGC C TTCAGACGTC CCTCCCACCA TGGCTGTGAA GTTCCTGGGG 24 0 

GCAGG CACAG CAGCCTGTTT TGCTGAACTC GTTACCTTTC CACTGGACAC AGCCAAGGTC 3 00 

CGCCTGCAGA TCCAGGGGGA GAACCAGGCG GTCCAGACGG CCCGGCTCGT GCAGTAC CGT 3 60 

GGCGTGCTGG GCACCATCCT GACCATGGTG CGGACTGAGG GTCCCTGCAG CCCCTACAAT 420 

GGGCTGGTGG CCGGCCTGCA GCGCCAGATG AGCTTCGCCT CCATCCGCAT CGGCCTCTAT 4 80 

GACTCCGTCA AGCAGGTGTA CACCCCCAAA GGCGCGGACA ACTCCAGCCT CACTACCCGG 540 

ATTTTGGCCG GCTGCACCAC AGGAGCCATG GCGGTGACCT GTGCCCAGCC CACAGATGTG 600 

GTGAAGGTCC GATTTCAGGC CAGCATACAC CTCGGGCCAT C C AGGAGCG A CAGAAAATAC 660 

AGCGGGACTA TGGACGC CT A CAGAACCATC GCCAGGGAGG AAGGAGTCAG GGGCCTGTGG 720 

AAAGGAACTT TGCCCAACAT CATGAGGAAT GCTATCGTCA ACTGTGCTGA GGTGGTG AC C 7 80 

TACGACATCC TCAAGGAGAA GCTGCTGGAC TACCACCTGC TCACTGACAA CTTCCCCTGC 840 

CACTTTGTCT CTGCCTTTGG AGCCGGCTTC TGTGCCACAG TGGTGGCCTC CCCGGTGGAC 900 

GTGGTGAAGA CCCGGTATAT GAACTCACCT CCAGGCCAGT ACTTCAGCCC CCTCGACTGT 960 

ATGATAAAGA TGGTGGCCCA GGAGGGCCCC ACAGCCTTCT ACAAGGGATT TACACCCTCC 102 0 

TTTTTGCGTT TGGGATCCTG GAACGTGGTG ATGTTCGTAA CCTATGAGCA GCTGAAACGG 10 80 

GCCCTGATGA AAGTCCAGAT GTTACGGGAA TCACCGTTTT GAACAAGACA AGAAGGCCAC 114 0 

TGGTAG CTAA CGTGTCCGAA AC C AGTTAAG AATGGAAGAA AACGGTGCAT CCACGCACAC 1200 

ATGGACACAG ACCCACACAT 122 0 
<2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1034 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = 11 DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



GAGGGGCCAT 


CCAATCCCTG 


CTGCCACCTC 


CTGGGATGGA 


GCCCTAGGGA 


GCCCCTGTGC 


60 


TGCCCCTGCC 


GTGGCAGGAC 


TCACAGCCCC 


ACCGCTGCCT 


GAAGCCCAGG 


GCTGTGGAGC 


120 


AGCCTCTCTC 


CTTGGACCTC 


CTCTCGGCCC 


TAAAGGGACT 


GGGCAGAGCC 


TTCCAGGACT 


180 


ATGGTTGGAC 


TGAAGCCTTC 


AGACGTGCCT 


CCCACCATGG 


CTGTGAAGTT 


CCTGGGGGCA 


240 


GGCACAGCAG 


CCTGTTTTGC 


TGAACTCGTT 


ACCTTTCCAC 


TGGACACAGC 


CAAGGTCCGC 


300 


CTGCAGATCC 


AG GGGG AGAA 


CCAGGCGGTC 


CAGACGGCCC 


GGCTCGTGCA 


GTACCGTGGC 


360 


GTGCTGGGCA 


CCATCCTGAC 


CATGGTGCGG 


ACTGAGGGTC 


CCTGCAGCCC 


CTACAATGGG 


420 


CTGGTGGCCG 


GCCTGCAGCG 


CCAGATGAGC 


TTCGCCTCCA 


TCCGCATCGG 


C CTCTATGAC 


480 


TCCGTCAAGC 


AGGTGTACAC 


CCCCAAAGGC 


GCGGACAACT 


TCCAGCCTCA 


CTACCCGGAT 


540 


TTTGGCCGGC 


TGCACCACAG 


GAGCCATGGC 


GGTGACCTGT 


GCCCAGCCCA 


CAGATGTGGT 


600 


GAAGGTCCGA 


TTTCAGGCCA 


GCATACACCT 


CGGGCCATCC 


AGGACCGACA 


GAAAATACAG 


660 


CGGGACTATG 


GACGCCTACA 


GAACCATCGC 


CAGGGAGGAA 


GGAGTCAGGG 


GCCTGTGGAA 


.720 


AGGAACTTTG 


CCCAACATCA 


TGAGGAATGC 


TATCGTCAAC 


TGTGCTGAGG 


TGGTGACCTA 


780 


CGACATCCTC 


AAGGAGAAGC 


TGCTGGACTA 


CCACCTGCTC 


ACTGACAACT 


TCCCCTGCCA 


840 


CTTTGTCTCT 


GCCTTTGGAG 


CCGGCTTCTG 


TGC CACAGTG 


GTGGCCTCCC 


CGGTGGACGT 


900 


GGTGAAGACC 


CGGTATATGA 


ACTCACCTCC 


AGGCCAGTAC 


TTCAGCCCCC 


TCGACTGTAT 


960 


GATAAAGATG 


GTGGCCCAGG 


AGGGCCCCAC 


AGCCTTCTAC 


AAGGGGTGAG 


CCTCCTCCTG 


1020 


CCTCCAGCAC 


TCCC 










1034 



(2) INFORMATION FOR SEQ ID NO; 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 312 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
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Met Val Gly Leu Lya Pro Ser Asp Val Pro Pro Thr Met Ala Val Lys 
1 5 10 15 

Phe Leu Gly Ala Gly Thr Ala Ala Cys Phe Ala Glu Leu Val Thr Pfce 
20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin lie Gin Gly Glu Asn Gin 
35 40 45 

Ala Val Gin Thr Ala Arg Leu Val Gin Tyr Arg Gly Val Leu Gly Thr 
50 55 60 

lie Leu Thr Met Val Arg Thr Glu Gly Pro Cys Ser Pro Tyr Asn Gly 
65 70 75 80 

Leu Val Ala Gly Leu Gin Arg Gin Met Ser Phe Ala Ser lie Arg lie 
85 90 95 

Gly Leu Tyr Asp Ser Val Lys Gin Val Tyr Thr Pro Lys Gly Ala Asp 
100 105 110 

Asn Ser Ser Leu Thr Thr Arg He Leu Ala Gly Cys Thr Thr Gay Ala 
115 120 125 

Met Ala Val Thr Cys Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe 
130 135 140 

Gin Ala Ser He His Leu Gly Pro Ser Arg Ser Asp Arg Lys Tyr Ser 
145 150 155 160 

Gly Thr Met Asp Ala Tyr Arg Thr He Ala Arg Glu Glu Gly Val Arg 
165 170 175 

Gly Leu Trp Lys Gly Thr Leu Pro Asn He Met Arg Asn Ala He Val 
180 185 190 

Asn Cys Ala Glu Val Val Thr Tyr Asp He Leu Lys Glu Lys Leu Leu 
195 200 205 

Asp Tyr His Leu Leu Thr Asp Asn Phe Pro Cys His Phe Val Ser Ala 
210 215 220 

Phe Gly Ala Gly Phe Cys Ala Thr Val Val Ala Ser Pro Val Asp Val 
225 230 235 240 

Val Lys Thr Arg Tyr Met Asn Ser Pro Pro Gly Gin Tyr Phe Ser Pro 
245 250 255 

Leu Asp Cys Met He Lys Met Val Ala Gin Glu Gly Pro Thr Ala Phe 
260 265 270 

Tyr Lys Gly Phe Thr Pro Ser Phe Leu Arg Leu Gly Ser Trp Asn Val 
275 280 285 

Val Met Phe Val Thr Tyr Glu Gin Leu Lys Arg Ala Leu Met Lys Val 
290 295 300 

Gin Met Leu Arg Glu Ser Pro Phe 
305 310 

(2) INFORMATION FOR SEQ ID NO : 4 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Val Gly Leu Lys Pro Ser Asp Val Pro Pro Thr Met Ala Val Lys 
1 5 10 15 

Phe Leu Gly Ala Gly Thr Ala Ala Cys Phe Ala Glu Leu Val -Thr Phe 
20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin lie Gin Gly Glu Asn Gin 
35 40 45 

Ala Val Gin Thr Ala Arg Leu Val Gin Tyr Arg Gly Val Leu Gly Thr 
50 55 60 

lie Leu Thr Met Val Arg Thr Glu Gly Pro Cys Ser Pro Tyr Asn Gly 
65 70 75 80 

Leu Val Ala Gly Leu Gin Arg Gin Met Ser Phe Ala Ser lie Arg lie 
85 90 95 

Gly Leu Tyr Asp Ser Val Lys Gin Val Tyr Thr Pro Lys Gly Ala Asp 
100 105 110 

Asn Ser Ser Leu Thr Thr Arg lie Leu Ala Gly Cys Thr Thr Gly Ala 
115 120 125 

Met Ala Val Thr Cys Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe 
130 135 140 

Gin Ala Ser lie His Leu Gly Pro Ser Arg Ser Asp Arg Lys Tyr Ser 
145 150 155 160 

Gly Thr Met Asp Ala Tyr Arg Thr lie Ala Arg Glu Glu Gly Val Arg 
165 170 175 

Gly Leu Trp Lys Gly Thr Leu Pro Asn lie Met Arg Asn Ala lie Val 
180 185 190 

Asn Cys Ala Glu Val Val Thr Tyr Asp lie Leu Lys Glu Lys Leu Leu 
195 200 205 

Asp Tyr His Leu Leu Thr Asp Asn Phe Pro Cys His Phe Val Ser Ala 
210 215 220 

Phe Gly Ala Gly Phe Cys Ala Thr Val Val Ala Ser Pro Val Asp Val 
225 230 235 240 

Val Lys Thr Arg Tyr Met Asn Ser Pro Pro Gly Gin Tyr Phe Ser Pro 
245 250 255 
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Leu Asp Cys Met He Lys Met Val Ala Gin Glu Gly Pro Thr Ala Phe 
260 265 270 

Tyr Lys Gly 
275 

{ 2 ) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 amino acids 

(B) TYPE: amino acid 

{ C ) STRANDEDNES S : unknown 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Gly Gly Leu Thr Ala Ser Asp Val His Pro Thr Leu Gly Val Gin 
1 5 10 \S 

i 

Leu Phe Ser Ala Gly He Ala Ala Cys Leu Ala Asp Val He Thr Phe 
20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin Val Gin Gly Glu Cys Pro 
35 40 45 

Thr Ser Ser Val lie Arg Tyr Lys Gly Val Leu Gly Thr He Thr Ala 
50 55 60 

Val Val Lys Thr Glu Gly Arg Met Lys Leu Tyr Ser Gly Leu Pro Ala 
65 70 75 80 

Gly Leu Gin Arg Gin He Ser Ser Ala Ser Leu Arg He Gly Leu Tyr 
85 90 95 

Asp Thr Val Gin Glu Phe Leu Thr Ala Gly Lys Glu Thr Ala Pro Ser 
100 105 110 

Leu Gly Ser Lys He Leu Ala Gly Leu Thr Thr Gly Gly Val Ala Val 
115 120 125 

Phe He Gly Gin Pro Thr Glu Val Val Lys Val Arg Leu Gin Ala Gin 
130 135 140 

Ser Hio Leu Hia Gly He Lyo Pro Arg Tyr Thr Gly Thr Tyr Asn Ala 
145 150 155 160 

Tyr Arg He He Ala Thr Thr Glu Gly Leu Thr Gly Leu Trp Lys Gly 
165 170 175 

Thr Thr Pro Asn Leu Met Arg Ser Val He He Asn Cys Thr Glu Leu 
180 185 190 , 

Val Thr Tyr Asp Leu Met Lys Glu Ala Phe Val Lys Asn Asn He Leu 
195 200 205 

Ala Asp ' Asp Val Pro Cys His Leu Val Ser Ala Leu He Ala Gly Phe 
210 215 220 
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Cys Ala Thr Ala Met Ser Ser Pro Val Asp Val Val Lys T,hr Arg Phe 
225 230 235 240 

lie Asn Ser Pro Pro Gly Gin Tyr Lys Ser Val Pro Asn Cys Ala Met 
245 250 255 

Lys Val Phe Thr Asn Glu Gly Pro Thr Ala Phe Phe Lys Gly Leu Val 
260 265 270 

Pro Ser Phe Leu Arg Leu Gly Ser Trp Asn Val lie Met Phe Val Cys 
275 280 285 

Phe Glu Gin Leu Lys Arg Glu Leu Ser Lys Ser Arg Gin Thr Met Asp 
290 295 300 

Cys Ala Thr 
305 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Val Gly Phe Lys Ala Thr Asp Val Pro Pro Thr Ala Thr Val Lys 
15 10 15 

Leu Phe Gly Ala Gly Thr Ala Ala Cys. lie Ala Asp Leu lie Thr Phe 
20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin lie Gin Gly Glu Ser Gin 
35 40 45 

Gly Pro Val Arg Ala Thr Val Ser Ala Gin Tyr Arg Gly Val Met Gly 
50 55 . : 60 

Thr lie Leu Thr Met Val Arg Thr- Glu Gly Pro Arg Ser Leu Tyr Asn 
65 70 " 75 80 

Cys Leu Val Ala Gly Leu Gin Arg Gin Met Ser Phe Ala Ser Val Arg 
85 90 95 

lie Gly Leu Tyr Asp Ser Val Lys Gin Phe Tyr Thr Lys Gly Ser Glu 
100 105 110 

His Ala Ser lie Gly Ser Arg Leu Leu Ala Gly Ser Thr Thr Gly Ala 
115 120 125 

» Leu Ala Val Ala Val Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe 
130 135 140 

Gin Ala Gin Arg Ala Gly Gly Gly Arg Arg Tyr Gin Ser Thr Val Asn 
145 150 155 160 
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Ala Tyr Lye Thr He Ala Arg Glu Glu Gly Phe Arg Gly Leu Trp Lys 
165 ' 170 175 

Gly Thr Ser Pro Asn Val Ala Arg Asn Ala He Val Asn Cys Ala Glu 
180 185 190 

Leu Val Thr Tyr Asp Leu He Lys Asp Ala Leu Leu Lys Ala Asn Leu 
195 200 205 

Met Thr Asp Asp Leu Pro Cys His Phe Thr Ser Ala Phe Gly Ala Gly 
210 215 220 

Phe Cys Thr Thr Val He Ala Ser Pro Val Asp Val Val Lys Thr Arg 
225 230 235 240 

Tyr Met Asn Ser Ala Leu Gly Gin Tyr Ser Ser Ala Gly His Cys Ala 
245 250 255 

Leu Thr Met Leu Gin Lys Glu Gly Pro Arg Ala Phe Tyr Lys Gly Phe 
260 265 270 

Met Pro Ser Phe Leu Arg Leu Gly Ser Trp Asn Val Val Met Phe Val 
275 280 285 

Thr Tyr Glu Gin Leu Lys Arg Ala Leu Met Ala Ala Cys Thr Ser Arg 
290 295 300 

Glu Ala Pro Phe 
305 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1204 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc o "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 7: 
GAGACAACAG TGAATGGTGA GGCCCGGCCG TCAGATCCTG CTGCTACCTA ATGGAGTGGA 60 

GCCTTAGGGT GGCCCTGCAC TACCCAACCT TGGCTAGACG CACAGCTTCC TCCCTGAACT 12 0 

GAAGCAAAAG ATTGCCAGGC AAGCTCTCTC CTCGGACCTC CATAGGCAGC AAAGGAACCA 180 

GGCCCATTCC CCGGGACCAT GGTTGGACTT CAGCCCTCCG AAGTGCCTCC CACAACGGTT 240 

GTGAAGTTCC TGGGGGCCGG CACTGCGGCC TGTTTTGCGG ACCTCCTCAC TTTTCCCCTG 300 

GACACCGCCA AGGTCCGTCT GCAGATCCAA GGGGAGAACC CAGGGGCTCA GAGCGTGCAG 360 

TACCGCGGTG TGCTGGGTAC CATCCTGACT ATGGTGCGCA CAGAGGGTCC CCGCAGCCCC 420 

TACAGCGGAC TGGTCGCTGG CCTGCACCGC CAGATGAGTT TTGCCTCCAT TCGAATTGGC 4 80 

CTCTACGACT CTGTCAAGCA GTTCTACACC CCCAAGGGAG CGGACCACTC CAGCGTCGCC 540 
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ATCAGGATTC TGGCAGGCTG CACGACAGGA GCCATGGCAG TGACCTGCGC CCAGCCCACG 600 

GATGTGGTCA AGGTCCGATT TCAAGCCATG ATACGCCTGG GAACTGGAGG AGAGAGGAAA 660 

TACAGAGGGA CTATGGATGC CTACAGAACC ATCGCCAGGG AGGAAGGAGT CAGGGGCCTG 720 

TGGAAAGGGA CTTGGCCCAA CATCACAAGA AATGCCATTG TCAACTGTGC TGAGATGGTG 780 

ACCTACGACA TCATCAAGGA GAAGTTGCTG GAGTCTCACC TGTTTACTGA CAACTTCCCC 840 

TGTCACTTTG TCTCTGCCTT TGGAGCTGGC TTCTGTGCCA CAGTGGTGGC CTCCCCGGTC 90 0 

GATGTGGTAA AGACCCGATA CATGAACGCT CCCCTAGGCA GGTACCGCAG CCCTCTGCAC 960 

TGTATGCTGA AGATGGTGGC TCACGAGGGA CCCACGGCCT TCTACAAAGG ATTTGTGCCC 102 0 

TCCTTTCTGC GTCTGGGAGC TTGGAACGTG ATGATGTTTG TAACATATCA GCAACTGAAG 1080 

AGGGCCTTAA TGAAAGTCCA GGTACTGCGG GAATCTCCGT TTTGAACAAG GCAAGCAGGC 114 0 

TGCCTGGAAC AGAACAAAGC GTCTCTGCCT GGGACACAGG CCCACACGTC AGAACCGTGC 12 00 
ACGC 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Val Gly Leu Gin Pro Ser Glu Val Pro Pro Thr Thr Val Val Lys 
1 5 10 15 

Phe Leu Gly Ala Gly Thr Ala Ala Cys Phe Ala Asp Leu Leu Thr Phe 
20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin lie Gin Gly Glu Asn Pro 
35 40. 45 

Gly Ala Gin Ser Val Gin Tyr Arg Gly Val Leu Gly Thr He Leu Thr 
50 55 60 

Met Val Arg Thr Glu Gly Pro Arg Ser Pro Tyr Ser Gly Leu Val Ala 
65 70 75 80 

Gly Leu His Arg Gin Met Ser Phe Ala Ser He Arg He Gly Leu Tyr 
85 90. 95 

Asp Ser Val Lys Gin Phe Tyr Thr Pro Lys Gly Ala Asp His Ser Ser 
100 < 105 HO 

Val Ala He Arg He Leu Ala Gly Cys Thr Thr Gly Ala Met Ala Val 
115 .120 125 
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Thr Cys Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe Gin Ala Met 
130 135 140 

He Arg Leu Gly Thr Gly Gly Glu Arg Lys Tyr Arg Gly Thr Met Asp 
145 150 155 160 

Ala Tyr Arg Thr He Ala Arg Glu Glu Gly Val Arg Gly Leu Trp Lys 
165 170 175 

Glv Thr Trp Pro Asn lie Thr Arg Asn Ala He Val Asn Cys Ala Glu 
y 180 185 190 

Met Val Thr Tyr Asp He He Lys Glu Lys Leu Leu Glu Ser His Leu 
195 200 205 

Phe Thr Asp Asn Phe Pro Cys His Phe Val Ser Ala Phe Gly Ala Gly 
2io 215 220 

Phe Cvs Ala Thr Val Val Ala Ser Pro Val Asp Val Val Lys Thr Arg 
225 230 235 240 

Tvr Met Asn Ala Pro Leu Gly Arg Tyr Arg Ser Pro Leu His Cys Met 
y 245 ' 250 255 

Leu Lys Met Val Ala Gin Glu Gly Pro Thr Ala Phe Tyr Lys Gly Phe 
260 265 270 

Val Pro Ser Phe Leu Arg Leu Gly Ala Trp Asn Val Met Met Phe Val 
275 280 285 

Thr Tyr Glu Gin Leu Lys Arg Ala Leu Met Lys Val Gin Val Leu Arg 
290 295 300 

Glu Ser Pro Phe 
305 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 07 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Met Val Asn Pro Thr Thr Ser Glu Val Gin Pro Thr Met Gly Val Lys 
! 5 10 13 

He Phe Ser Ala Gly Val Ser Ala Cys Leu Ala Asp He He Thr Phe 
20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin He Gin Gly Glu Gly Gin 
35 40 45 
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Ala Ser Ser Thr lie Arg Tyr Lys Gly Val Leu Gly Thr lie Thr Thr 
50 55 60 

Leu Ala Lys Thr Glu Gly Leu Pro Lys Leu Tyr Ser Gly Leu Pro Ala 
65 70 75 80 

Gly He Gin Arg Gin He Ser Phe Ala Ser Leu Arg He Gly Leu Tyr 
85 90 95 

Asp Ser Val Gin Glu Tyr Phe Ser Ser Gly Arg Glu Thr Pro Ala Ser 
100 105 HO 

Leu Gly Asn Lys He Ser Ala Gly Leu Met Thr Gly Gly Val Ala Val 
115 120 125 

Phe He Gly Gin Pro Thr Glu Val Val Lys Val Arg Met Gin Ala Gin 
130 135 140 

Ser His Leu His Gly He Lys Pro Arg Tyr Thr Gly Thr Tyr Asn Ala 
145 150 155 160 

Tyr Arg Val He Ala Thr Thr Glu Ser Leu Ser Thr Leu Trp Lys Gly 
165 170 175 

Thr Thr Pro Asn Leu Met Arg Asn Val He He Asn Cys Thr Glu Leu 
180 185 190 

Val Thr Tyr Asp Leu Met Lys Gly Ala Leu Val Asn Asn Lys He Leu 
195 200 205 

Ala Asp Asp Val Pro Cys His Leu Leu Ser Ala Leu Val Ala Gly Phe 
210 215 220 

Cys Thr Thr Leu Leu Ala Ser Pro Val Asp Val Val Lys Thr Arg Phe 
225 230 235 240 

He Asn Ser Leu Pro Gly Gin Tyr Pro Ser Val Pro Ser Cys Ala Met 
245 250 255 

Ser Met Tyr Thr Lys Glu Gly Pro Thr Ala Phe Phe Lys Gly Phe Val 
260 265 270 

Ala Ser Phe Leu Arg Leu Gly Ser Trp Asn Val He Met Phe Val Cys 
275 280 285 

Phe Glu Gin Leu Lys Lys Glu Leu Met Lys Ser Arg Gin Thr Val Asp 
290 295 300 

Cys Thr Thr 
305 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Met Val Gly Phe Lys Ala Thr Asp Val Pro Pro Thr Ala Thr Val Lys 
1 5 10 15 

Phe Leu Gly Ala Gly Thr Ala Ala Cys He Ala Asp Leu He Thr Phe 
20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin He Gin Gly Glu Ser Gin 
35 40 45 

Gly Leu Val Arg Thr Ala Ala Ser Ala Gin Tyr Arg Gly Val Leu Gly 
50 55 60 

Thr He Leu Thr Met Val Arg Thr Glu Gly Pro Arg Ser Leu Tyr Asn 
65 . 70 75 80 

Gly Leu Val Ala Gly Leu Gin Arg Gin Met Ser Phe Ala Ser Val Arg 
85 90 95 

He Gly Leu Tyr Asp Ser Val Lys Gin Phe Tyr Thr Lys Gly Ser Glu 
100 105 HO 

His Ala Gly He Gly Ser Arg Leu Leu Ala Gly Ser Thr Thr Gly Ala 
115 120 125 

Leu Ala Val Ala Val Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe 
130 135 140 

Gin Ala Gin Ala Arg Ala Gly Gly Gly Arg Arg Tyr Gin Ser Thr Val 
145 150 155 160 

Glu Ala Tyr Lys Thr He Ala Arg Glu Glu Gly He Arg Gly Leu Trp 
165 170 175 

Lys Gly Thr Ser Pro Asn Val Ala Arg Asn Ala He Val Asn Cys Ala 
180 185 190 

Glu Leu Val Thr Tyr Asp Leu He Lys Asp Thr Leu Leu Lys Ala Asn 
195 200 205 

Leu Met Thr Asp Asp Leu Pro Cys His Phe Thr Ser Ala Phe Gly Ala 
210 215 220 

Gly Phe Cys Thr Thr Val He Ala Ser Pro Val Asp Val Val Lys Thr 
225 230 235 240 

Arg Tyr Met Asn Ser Ala Leu Gly Gin Tyr His Ser Ala Gly His Cys 
245 250 255 

Ala Leu Thr Met He Arg Lys Glu Gly Pro Arg Ala Phe Tyr Lys Gly 
260 265 270 

Phe Met Pro Ser Phe Leu Arg Leu Gly Ser Trp Asn Val Val Met Phe 
275 280 285 

Val Thr Tyr Glu Gin Leu Lys Arg Ala Leu Met Ala Ala Tyr Gin Ser 
290 295 ' * 300 

Arg Glu Ala Pro Phe 
305 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGACTCACAG GTAAGACCCC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
TCTCCTGCAG CCCCACCGCT 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single- 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : . 
CCGCCTGCAG GTAGGTGCCC 
(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(A) DESCRIPTION: /desc = "DNA " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ala Thr Cys Cys Ala Gly 
± 5 10 15 

Gly Gly Gly Gly 
20 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
' (A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ: ID NO: 15: 
GGCGCGGACA GTGAGTGACC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
CCCCTCCCAG ACTCCAGCCT 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17 : 
CTGTGGAAAG GTAGGTCTGG 



20 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Ala Ala Cys Thr Thr 
1 5 10 15 

Thr Gly Cyo Cyn 
20 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CTGCTCACTG GTGAGGCCCT 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid • 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20 
TCCTCTGCAG ACAACTTCCC 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
TCTACAAGGG GTGAGCCTCC 
(2) INFORMATION FOR SEQ ID NO: 22: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 22 
TTCTTATCAG ATTTACACCC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GGACTACCAC CTGCTCACTG 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



WO 98/45438 



PCT/US98/06959 



-61- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CCCGTAACAT ATGGACTTT 19 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic. acid 
(A) DESCRIPTION: /desc = "DNA" 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
TTCACCACGT CCACCCGGGG GGATGCCACC 30 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AATAAA 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Arg Arg Gly His Pro lie Pro Ala Ala Thr Ser Trp Asp Gly Ala Leu 
1 5 10 - 15 

Gly Ser Pro Cys Ala Ala Pro Ala Val Ala Gly lie Thr Ala Pro Pro 
20 25 30 

Leu His Ser Pro Gly Leu Trp Ser Ser Leu Ser Pro Trp Thr Ser Ser 
35 40 45 
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Arg Pro Arg Asp Trp Ala Glu Pro Ser Arg Thr Met Val Gly Leu Lys 
50 55 60 

Pro Ser Asp Val Pro Pro Thr Met Ala Val Lys Phe Leu Gly Ala Gly 
65 70 75 80 

lie Ala Ala Cys Phe Ala Glu Leu Val Thr Phe Pro Leu Asp Thr Ala 
85 90 95 

Lys Val Arg Leu Gin lie Gin Gly Glu Asn Gin Ala Val Gin Thr Ala 
100 105 110 

Arg Leu Val Gin Tyr Arg Gly Val Leu Gly Thr lie Leu Thr Met Val 
115 120 125 

Arg Thr Glu Gly Pro Cys Ser Pro Tyr Asn Gly Leu Val Ala Gly Leu 
130 135 140 

Gin Arg Gin Met Ser Phe Ala Ser He Arg He Gly Leu Tyr Asp Ser 
145 150 155 160 

Val Lys Gin Val Tyr Thr Pro Lys Gly Ala Asp Asn Ser Ser Leu Thr 
165 17 0 1*7 5 

Thr Arg He Leu Ala Gly Cys Thr Thr Gly Ala Met Ala Val Thr Cys 
180 185 190 

Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe Gin Ala Ser He His 
195 200 205 

Leu Gly Pro Ser Arg Ser Asp Arg Lys Tyr Ser Gly Thr Met Asp Ala 
210 215 220 

Tyr Arg Thr He Ala Arg Glu Glu Gly Val Arg Gly Leu Trp Lys Gly 
225 230 235 240 

Thr Leu Pro Asn He Met Arg Asn Ala He Val Asn Cys Ala Glu Val 
245 250 255 

Val Thr Tyr Asp He Leu Lys Glu Lys Leu Leu Asp Tyr His Leu Leu 
260 265 270 

Thr Asp Asn Phe Pro Cys His Phe Val Ser Ala Phe Gly Ala Gly Phe 
275 280 285 

Cys Ala Thr Val Val Ala Ser Pro Val Asp Val Val Lys Thr Arg Tyr 
290 295 300 

Met Aon Ser Pro Pro Gly Gin Tyr Phe Ser Pro Leu Asp Cys Met He 
305 310 315 320 

Lys Met Val Ala Gin Glu Gly Pro Thr Ala Phe Tyr Lys Gly Phe Thr 
325 330 335 

Pro Ser Phe Leu Arg Leu Gly Ser Trp Asn Val Val Met Phe Yal Thr 
340 345 350 

Tyr Glu Gin Leu Lys Arg Ala Leu Met Lys Val Gin Met Leu Arg Glu 
355 360 ' 365 

Ser Pro Phe Tyr Arg Gin Glu Gly His Trp Leu Thr Cys Pro Lys Pro 
370 375 380 
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Val Lys Asn Gly Arg Lys Arg Cys He His Ala His Met 1 Asp Thr Asp 
385 390 395 400 

Pro His He 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 amino acids 

(B) TYPE: amino acid ■ 

(C) STRANDEDNESS : unknown 
i (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Gly Gly Ala He Gin Ser Leu Leu Pro Pro Pro Gly Met Glu Pro Gly 
15 10 15 

Ala Pro Val Leu Pro Leu Pro Trp Gin Asp Ser Gin Pro His Arg Cys 
20 25 30 

Thr Glu Ala Gin Gly Cys Gly Ala Ala Ser Leu Leu Gly Pro Pro Leu 
35 40 45 

Gly Pro Lys Gly Thr Gly Gin Ser Leu Pro Gly Leu Trp Leu Asp Ser 
50 55 60 

Leu Gin Thr Cys Leu Pro Pro Trp Leu Ser Ser Trp Gly Gin Ala Gin 
65 70 75 80 

Gin Pro Val Leu Leu Asn Ser Leu Pro Phe His Trp Thr Gin Pro Arg 
85 90 95 

Ser Ala Cys Arg Ser Arg Gly Arg Thr Arg Arg Ser Arg Arg Pro Gly 
100 105 110 

Ser Cys Ser Thr Val Ala Cys Trp Ala Pro Ser Pro Trp Cys Gly Leu 
115 120 125 

Arg Val Pro Ala Ala Pro Thr Met Gly Trp Trp Pro Ala Cys Ser Ala 
130 135 140 

Arg Ala Ser Pro Pro Ser Ala Ser Ala Ser Met Thr Pro Ser Ser Arg 
145 150 155 160 

Cys Thr Pro Pro Lys Ala Arg Thr Thr Pro Ala Ser Leu Pro Gly Phe 
165 170 1,75 

Trp Pro Ala Ala Pro Gin Glu Pro Trp Arg Pro Val Pro Ser Pro Gin 
180 185 190 

Met Trp Arg Ser Asp Phe Arg Pro Ala Tyr Thr Ser Gly His Pro Gly 
195 200 * 205 
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Ala Thr Glu Asn Thr Ala Gly Leu.Trp Thr Pro Thr Glu Pro Ser Pro 
210 215 220 

Glv Arg Lys Glu Ser Gly Ala Cys Gly Lys Glu Leu Cys Pro Thr Ser 
225 230 235 240 

Gly Met Leu Ser Ser Thr Val Leu Arg Trp Pro Thr Thr Ser Ser Arg 
245 250 255 

Ara Ser Cys Trp Thr Thr Thr Cys Ser Leu Thr Thr Ser Pro Ala Thr 
260 265 270 

Leu Ser Leu Pro Leu Glu Pro Ala Ser Val Pro Gin Trp Trp Pro Pro 
275 280 285 

Arg Trp Thr Trp Arg Pro Gly lie Thr His Leu Gin Ala Ser Thr Ser 
290 295 300 

Ala Pro Ser Thr Val Arg Trp Trp Pro Arg Arg Ala Pro Gin Pro Ser 
305 310 315 320 

Thr Arg Asp Leu His Pro Pro Phe Cys Val Trp Asp Pro Gly Thr Trp 
325 1 330 335 

Cys Ser Pro Met Ser Ser Asn Gly Pro Lys Ser Arg Cys Tyr Gly Asn 
340 345 350 

His Arg Phe Glu Gin Asp Lys Lys Ala Thr Gly Ser Arg Val Arg Asn 
355 360 365 

Gin Leu Arg Met Glu Glu Asn Gly Ala Ser Thr His Thr Trp Thr Gin 
370 375 380 

Thr His Thr 
385 

(2) INFORMATION FOR SEQ ID NO: 29: ( 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 99 amino acids 

(B) TYPE: amino acid 

{ C ) STRANDEDNESS : unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

Glu Gly Pro Ser Asn Pro Cys Cys His Leu Leu Gly Trp Ser Pro Arg 
15 10 15 

Glu Pro Leu Cys Cys Pro Cys Arg Gly Arg Thr His Ser Pro Thr Ala 
20 25 30 

Ala Leu Lys Pro Arg Ala Val Glu Gin Pro Leu Ser Leu Asp Leu Leu 
35 40 45 

Ser Ala Leu Lys Gly Leu Gly Arg Ala Phe Gin Gin Tyr Gly Trp Thr 
50 55 60 
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Glu Ala Phe Arg Arg Ala Ser His His Gly Cys Glu Val Pro Gly Gly 
65 70 75 80 

Arg His Ser Ser Leu Phe Cys Thr Arg Tyr Leu Ser Thr Gly His Ser 
85 90 95 

Gin Gly Pro Pro Ala Asp Pro Gly Gly Glu Pro Gly Gly Pro Gin Gly 
100 105 110 

Pro Ala Arg Ala Val Pro Trp Arg Ala Gly His His Pro Asp His Gly 
115 120 125 

Ala Asp Gly Ser Leu Gin Pro Leu Gin Trp Ala Gly Gly Arg Pro Ala 
130 135 140 

Ala »Pro Asp Glu Leu Arg Leu His Pro His Arg Pro Leu Leu Arg Gin 
145 150 155 160 

Ala Gly Val His Pro Gin Arg Arg Gly Gin Leu Gin Pro His Tyr Pro 
165 170 175 

Asp Phe Gly Arg Leu His His Arg Ser His Gly Gly Asp Leu Qys Pro 
180 185 190 • 

Ala His Arg Cys Gly Glu Gly Pro lie Ser Gly Gin His Thr Pro Arg 
195 200 205 

Ala lie Gin Glu Arg Gin Lys lie Gin Arg Asp Tyr Gly Arg Leu Gin 
210 215 220 

Asn His Arg Gin Gly Gly Arg Ser Gin Gly Pro Tyr Glu Arg Asn Phe 
225 230 235 240 

Ala Gin His His Glu Glu Cys Tyr Arg Gin Leu Gly Gly Gly Asp Leu 
245 250 255 

Arg His Pro Gin Gly Glu Ala Ala Gly Leu Pro Pro Ala His Gin Leu 
260 265 270 

Pro Leu Pro Leu Cys Leu Cys Leu Trp Ser Arg Leu Leu Cys His Ser 
275 280 285 

Gly Gly Leu Pro Gly Gly Arg Gly Glu Asp Pro Val Tyr Glu Leu Thr 
290 295 300 

Ser Arg Pro Val Leu Gin Pro Pro Arg Leu Tyr Asp Lys Asp Gly Gly 
305 310 315 320 

Pro Gly Gly Pro His Ser Leu Leu Gin Gly lie Tyr Thr Leu Leu Phe 
325 330 335 

Ala Phe Gly lie Leu Glu Arg Gly Asp Val Arg Asn Leu Ala Ala Glu 
340 345 350 

Thr Gly Pro Asp Glu Ser Pro Asp Val Thr Gly lie Thr Val ^eu Asn 
355 360 365 

Lys Thr Arg Arg Pro Leu Val Ala Lys Val Ser Glu Thr Ser Glu Trp 
370 375 380 

Lys Lys Thr Val His Pro Arg Thr His Gly His Arg Pro Thr His 
385 390 395 
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(2) INFORMATION FOR SEQ ID NO: 30: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

Arg Gly His Pro He Pro Ala Ala Thr Ser Trp Asp Gly Ala Leu Gly 
1,5 10 15 

Ser Pro Cys Ala Ala Pro Ala Val Ala Gly Leu Thr Ala Pro Pro Leu 
20 25 30 

Ser Pro Gly Leu Trp Ser Ser Leu Ser Pro Trp Thr Ser Ser Arg Pro 
35 40 45 

i 

Arg Asp Trp Ala Glu Pro Ser Arg Thr Met Val Gly Leu Lys Pro Ser 
50 55 ! 60 

Asp Val Pro Pro Thr Met Ala Val Lys Phe Leu Gly Ala Gly Thr Ala 
65 70 75 80 

Ala Cys Phe Ala Glu Leu Val Thr Phe Pro Leu Asp Thr Ala Lys Val 
85 ■! 90 95 

Arg Leu Gin He Gin Gly Glu Asn Gin Ala Val Gin Thr Ala Arg Leu 
100 105 110 

Val Gin Tyr Arg Gly Val Leu Gly Thr He Leu Thr Met Val Arg Thr 
115 120 125 

Glu Gly Pro Cys Ser Pro Tyr Asn Gly Leu Val Ala Gly Leu Gin Arg 
130 135 140 

Gin Met Ser Phe Ala Ser He Arg He Gly Leu Tyr Asp Ser Val Lys 
145 150 155 160 

Gin Val Tyr Thr Pro Lys Gly Ala Asp Asn Ser Ser Leu Thr Thr Arg 
165 170 175 

He Leu Ala Gly Cys Thr Thr Gly Ala Met Ala Val Thr Cys Ala Gin 
180 185 190 

Pro Thr Asp Val Val Lys Val Arg Phe Gin Ala Ser He His Leu Gly 
195 200 205 

Pro Ser Arg Ser Asp Arg Lys Tyr Ser Gly Thr Met Asp Ala Tyr Arg 
210 215 220 

Thr He Ala Arg Phe Glu Gly Val Arg Gly Leu Trp Lys Gly Thr Leu 
225 230 235 240 

Pro Asrf He Met Arg Asn Ala lie Val Asn Cys Ala Glu Val Val Thr 
245 250 255 
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Tyr Asp lie Leu Lys Glu Lys Leu Asp Tyr His Leu Leu Thr Asp Asn 
260 265 270 

Phe Pro Cys His Phe Val Ser Ala Phe Gly Ala Gly Phe Cys Ala Thr 
275 280 285 

Val Val Ala Ser Pro Val Asp Val Val Lys Thr Arg Tyr Met Asn Ser 
290 295 300 

Pro Pro Gly Gin Tyr Phe Ser Pro Leu Asp Cys Met lie Lys Met Val 
305 310 315 f 320 

Ala Glri Glu Gly Pro Thr Ala Phe Tyr Lys Gly Ala Ser Ser Cys Leu 
325 330 335 

Gin His Ser 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 331 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Gly Ala lie Gin Ser Leu Leu Pro Pro Pro Gly Met Glu Pro Gly Ala 
1 5 10 15 

Pro Val Leu Pro Leu Pro Trp Gin Asp Ser Gin Pro His Arg Cys lie 
20 25 30 

Glu Ala Gin Gly Cys Gly Ala Ala Ser Leu Leu Gly Pro Pro Leu Gly 
35 40 45 

Pro Lys Gly Thr Gly Gin Ser Leu Pro Gly Leu Trp Leu Asp Ser Leu 
50 55 60 

Gin Thr Cys Leu Pro Pro Trp Leu Ser Ser Trp Gly Gin Ala Gin Gin 
65 70 75 80 

Pro Val Leu Leu Asn Ser Leu Pro Phe His Trp Thr Gin Pro Arg Ser 
85 90 95 

Ala Cys Arg Ser Arg Gly Arg lie Arg Arg Ser Arg Arg Pro Gly Ser 
100 105 110 

Cys Ser Thr Val Ala Cys Trp Ala Pro Ser Pro Trp Cys Gly Leu Arg 
115 120 125 

Val Pro Ala Ala Pro Thr Met Gly Trp Trp Pro Ala Cys Ser Ala Arg 
130 135 140 

• Ala Ser Pro Pro Ser Ala Ser Ala Ser Met Thr Pro Ser Ser Arg Cys 
145 150 155 160 
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Thr Pro Pro Lys Ala Arg Thr Thr Pro Ala Ser Leu Pro Gly Phe Trp 
165 170 175 

Pro Ala Ala Pro Gin Glu Pro Trp 'Arg Pro Val Pro Ser Pro Gin Met 
180 185 190 

Trp Arg Ser Asp Phe Arg Pro Ala Tyr Thr Ser Gly His Pro Gly Ala 
195 200 205 

Thr Glu Asn Thr Ala Gly Leu Trp Thr Pro Thr Glu Pro Ser Pro Gly 
210 215 220 

Arg Lys Glu Ser Gly Ala Cys Gly:Lys Glu Leu Cys Pro Thr Ser Gly 
225 230 235 240 

Met Leu Ser Ser Thr Val Leu Arg Trp Pro Thr Thr Ser Ser Arg Arg 
245 250 255 

Ser Cys Trp Thr Thr Thr Cys Ser Leu Thr Thr Ser Pro Ala Thr Leu 
260 265 270 

Ser Leu Pro Leu Glu Pro Ala Ser Val Pro Gin Trp Trp Pro Pro Arg 
275 280 285 

Trp lie Trp Arg Pro Gly lie Thr His Leu Gin Ala Ser Thr Ser Ala 
290 295 300 

Pro Ser Thr Val Arg Trp Trp Pro Arg Arg Ala Pro Gin Pro Ser Thr 
305 310 315 320 

Arg Gly Glu Pro Pro Pro Ala Ser Ser Thr Pro 
325 330 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 340 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : unknown 

(D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Glu Gly Pro Ser Asn Pro Cys Cys His Leu Leu Gly Trp Ser Pro Arg 
1 5 10 15 

Glu Pro Leu Cys Cys Pro Cys Arg Gly Arg Thr His Ser Pro Thr Ala 
20 25 30 

Ala Leu Lys Pro Arg Ala Val Glu Gin Pro Leu Ser Leu Asp Leu Leu 
35 40 45 

Ser Ala 1 Leu Lys Gly Leu Gly Arg Ala Phe Gin Asp Tyr Gly Trp Thr 
50 55 60 
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Glu Ala Phe Arg Arg Ala Ser His His Gly Cys Glu Val ?ro Gly Gly 
65 70 75 80 

Arg His Ser Ser Leu Phe Cys Thr Arg Tyr Leu Ser Thr Gly His Ser 
85 90 95 

Gin Gly Pro Pro Ala Asp Pro Gly Gly Glu Pro Gly Gly Pro Asp Gly 
100 105 110 

Pro Ala Arg Ala Val Pro Trp Arg Ala Gly His His Pro Asp His Gly 
115 120 125 

Ala Asp Gly Ser Leu Gin Pro Leu Gin Trp Ala Gly Gly Arg Pro Ala 
130 135 140 

Ala Pro Asp Glu Leu Arg Leu His Pro His Arg Pro Leu Leu Arg Gin 
145 150 155 160 

Ala Gly Val His Pro Gin Arg Arg Gly Gin Leu Gin Pro His Tyr Pro 
165 170 175 

Asp Phe Gly Arg Leu His His Arg Ser His Gly Gly Asp Leu Cys Pro 
180 185 190 . 

Ala His Arg *Cys Gly Glu Gly Pro lie Ser Gly Gin His Thr Pro Arg 
195 200 205 

Ala lie Gin Glu Arg Gin Lys lie Gin Arg Asp Tyr Gly Arg Leu Gin 
210 215 220 

Asn His Arg Gin Gly Gly Arg Ser Gin Gly Pro Val Glu Arg Asn Phe 
225 230 235 240 

Ala Gin His His Glu Glu Cys Tyr Arg Gin Leu Cys Gly Gly Asp Leu 
245 250 255 

Arg His Pro Gin Gly Glu Ala Ala Gly Leu Pro Pro Ala His Cys Leu 
260 265 270 

Pro Leu Pro Leu Cys Leu Cys Leu Trp Ser Arg Leu Leu Cys His Ser 
275 280 285 

Gly Gly Leu Pro Gly Gly Arg Gly Glu Asp Pro Val Tyr Glu Leu Thr 
290 295 300 

Ser Arg Pro Val Leu Gin Pro Pro Arg Leu Tyr Asp Lys Asp Gly Gly 
305 310 315 320 

Pro Gly Gly Pro His Ser Leu Leu Cys Gly Val Ser Leu Leu Leu Pro 
325 330 335 

Pro Ala Leu Pro 
340 

(2) INFORMATION FOR SEQ ID NO: 33: 

■ x (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 397 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Glu Thr Thr Val Asn Gly Glu Ala Arg Pro Ser Asp Pro Ala Ala Thr 
1 5 10 15 

Trp Ser Cys Ala lie Gly Trp Pro Cys Thr Thr Gin Pro Trp Leu Asp 
20 25 30 

Ala Gin Leu Pro Pro Thr Glu Ala Lys Asp Cys Gin Ala Ser Ser Leu 
35 40 45 

Leu Gly Pro Pro Ala Ala Lys Glu Pro Gly Pro Phe Pro Gly Thr Met 
50 55 60 

Val Gly Leu Gin Pro Ser Glu Val Pro Pro Thr Thr Val Val Lys Phe 
SB 70 75 80 

Leu Gly Ala Gly Thr Ala Ala Cys Phe Ala Asp Leu Leu Thr Phe Pro 
85 90 9*5 

Leu Asp Thr Ala Lys Val Arg Leu Gin lie Gin Gly Glu Asn Pro Gly 
100 105 110 

Ala Cys Ser Val Gin Tyr Arg Gly Val Leu Gly Thr lie Leu Thr Met 
115 120 125 

Val Arg Thr Glu Gly Pro Arg Ser Pro Tyr Ser Gly Leu Val Ala Gly 
130 135 140 

Leu His Arg Gin Met Ser Phe Ala Ser lie Arg He Gly Leu Tyr Asp 
145 150 155 160 

Ser Val Lys Gin Phe Tyr Thr Pro Lys Gly Ala Asp His Ser Ser Val 
165 170 175 

Ala He Arg He Leu Ala Gly Cys Thr Thr Gly Ala Met Ala Val Thr 
180 185 190 

Cys Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe Gin Ala Met He 
195 200 205 

Arg Leu Gly Thr Gly Gly Glu Arg Lys Tyr Arg Gly Thr Met Asp Ala 
210 215 220 

Tyr Arg Thr He Ala Arg Glu Glu Gly Val Arg Gly Leu Trp Lys Gly 
225 230 235 240 

Thr Trp Pro Asn He Thr Arg Asn Ala He Val Asn Cys Ala Glu Met 
245 250 255 

Val Thr Tyr Asp He He Lys Glu Lys Leu Leu Glu Ser His Leu Phe 
260 265 270 

Thr Asp Asn Phe Pro Cys His Phe Val Ser Ala Phe Gly Ala Gly Phe 
275 280 285 . 

Cys Ala Thr Val Val Ala Ser Pro Val Asp Val Val Lys Thr Arg Tyr 
290 295 300 
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Met Asn Ala Pro Leu Gly Arg Tyr Arg Ser Pro Leu His Cys Met Leu 
305 310 315 320 

Lys Met Val Ala Gin Glu Gly Pro Thr Ala Phe Tyr Lys Gly Phe Val 
325 330 335 

Pro Ser Phe Leu Arg Leu Gly Ala Trp Asn Val Met Met Phe Val Thr 
340 345 350 

Tyr Glu Gin Leu Lys Arg Ala Leu Met Lys Val Gin Val Leu Arg Glu 
355 360 365 

Ser Pro Phe Thr Arg Gin Ala Gly Cys Leu Glu Gin Asn Lys Ala Ser 
370 375 380 

Leu Pro Gly Thr Gin Ala His . Thr Ser Glu Pro Cys Thr 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Arg Gin Gin Met Val Arg Pro Gly Arg Gin lie Leu Leu Leu Pro Asn 
15 10 15 

Gly Val Glu Pro Gly Gly Pro Ala Leu Pro Asn Leu Gly Thr His Ser 
20 25 30 

Phe Leu Pro Glu Leu Lys Gin Lys lie Ala Arg Gin Ala Leu Ser Ser 
35 40 45 

Asp Leu His Arg Gin Gin Arg Asn Gin Ala His Ser Pro Gly Pro Trp 
50 55 60 

Leu Asp Phe Ser Pro Pro Lys Cys Leu Pro Gin Arg Leu Ser Ser Trp 
65 70 75 80 

Gly Pro Ala Leu Arg Pro Val Leu Arg Thr Ser Ser Leu Phe Pro Trp 
85 90 95 

Thr Pro Pro Arg Ser Val Cys Arg Ser Lys Gly Arg Thr Gin Gly Leu 
100 105 110 

Arg Ala Cys Ser Thr Ala Val Cys Trp Val Pro Ser Leu Trp Cys Ala 
115 120 125 

Asp Arg Val Pro Ala Ala Pro Thr Ala Asp Trp Ser Leu Ala Cys Thr 
130 135 140 

Ala Arg Val Leu Pro Pro Phe Glu Leu Ala Ser Thr Thr lie Ser Ser 
145 150 155 160 
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Ser Ser Thr Pro Pro Arg Glu Arg Thr Thr Pro Ala Ser Pro Ser Gly 
165 170 175 

Phe Trp Gin Ala Ala Arg Gin Glu Pro Trp Gin Pro Ala Pro Ser £ro 
180 185 190 

Arg Met Trp Arg Ser Asp Phe Lys Pro Tyr Ala Trp Glu Leu Glu Glu 
195 200 205 

Aro Gly Asn Thr Glu Gly Leu Trp Met Pro Thr Glu Pro Ser Pro Gly 
210 215 220 

Arg Lys Glu Ser Gly Ala Cys Gly Lys Gly Leu Gly Pro Thr Ser Gin 
225 230 235 240 

Glu Met Pro Leu Ser Thr Val Leu Arg Trp Pro Thr Thr Ser Ser Arg 
245 250 255 

Arg Ser Cys Trp Ser Leu Thr Cys Leu Leu Thr Thr Ser Pro Val Thr 
260 265 270 

Leu Ser Leu Pro Leu Glu Leu Ala Ser Val Pro Gin Trp Trp P s ro Pro 
275 280 285 

Arg Trp Met Trp Arg Pro Asp Thr Thr Leu Pro Ala Gly Thr Ala Ala 
290 295 300 

Leu Cys Thr Val Cys Arg Trp Trp Leu Arg Arg Asp Pro Arg Pro Ser 
305 310 315 320 

Thr Lys Asp Leu Cys Pro Pro Phe Cys Val Trp Glu Leu Gly Thr Cys 
325 330 335 

Leu His Met Ser Asn Arg Gly Pro Lys Ser Arg Tyr Cys Gly Asn Leu 
340 ' 345 350 

Arg Phe Glu Gin Gly Lys Gin Ala Ala Trp Asn Arg Thr Lys Arg Leu 
355 360 365 

Cys Leu Gly His Arg Pro Thr Arg Gin Asn Arg Ala Arg 
370 r 375 • 380 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 97 amino acids 

(B) * TYPE : amino acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Arg Asp Asn Ser Glu Trp Gly Pro Ala Val Arg Ser Cys Cys Tyr Leu 
1 .5 10 15 

Met Glu Trp Ser Leu Arg Val Ala Leu His Tyr Pro Thr Leu Ala Arg 
20 25 30 
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Arg Thr Ala Ser Ser Leu Asn Ser Lys Arg Leu Pro Gly -Lys Leu Ser 
35 40 45 

Pro Arg Thr Ser lie Gly Ser Lys Gly Thr Arg Pro lie Pro Arg Asp 
50 55 60 

His Gly Trp Thr Ser Ala Leu Arg Ser Ala Ser His Asn Gly Cys Glu 
65 70 75 80 

Val Pro Gly Gly Arg His Cys Gly Leu Phe Cys Gly Pro Pro His Phe 
85 90 95 

Ser Pro Gly His Arg Gin Gly Pro Ser Ala Asp Pro Arg Gly Glu Pro 
100 105 110 

Arg Gly Ser Glu Arg Ala Val Pro Arg Cys Ala Gly Tyr His Pro Asp 
115 120 125 

Tyr Gly Ala His Arg Gly Ser Pro Gin Pro Leu Gin Arg Thr Gly Arg 
130 135 140 

Trp Pro Ala Pro Pro Asp Glu Phe Cys Leu Met Ser Asn Trp Pro Leu 
145 150 155 ' 160 

Arg Leu Cys Gin Ala Val Leu His Pro Gin Gly Ser Gly Pro Leu Gin 
165 170 175 

Arg Arg His Gin Asp Ser Gly Arg Leu His Asp Arg Ser His Gly Ser 
180 185 190 

Asp Leu Arg Pro Ala His Gly Cys Gly Glu Gly Pro lie Ser Ser His 
195 200 205 

Asp Thr Pro Gly Asn Trp Arg Arg Glu Glu lie Gin Arg Asp Tyr Gly 
210 215 220 

Cys Leu Gin Asn His Arg Gin Gly Gly Arg Ser Gin Gly Pro Val Glu 
225 230 235 240 

Arg Asp Leu Ala Gin His His Lys Lys Cys His Cys Gin Leu Cys Asp 
245 250 255 

Gly Asp Leu Arg His His Gin Gly Glu Val Ala Gly Val Ser Pro Val 
260 265 270 

Tyr Gin Leu Pro Leu Ser Leu Cys Leu Cys Leu Trp Ser Trp Leu Leu 
275 280 285 

Cys His Ser Gly Gly Leu Pro Gly Gly Cys Gly Lys Asp Pro lie His 
290 295 300 

Glu Arg Ser Pro Arg Gin Val Pro Gin Pro Ser Ala Leu Tyr Ala Glu 
305 310 315 320 

Asp Gly Gly Ser Gly Gly Thr His Gly Leu Leu Gin Arg lie Cys Ala 
325 330 335 

Leu Leu Ser Ala Ser Gly Ser Leu Glu Arg Asp Asp Val Cys Asn lie 
340 345 350 

Ala Thr Glu Glu Gly Leu Asn Glu Ser Pro Gly Thr Ala Gly lie Ser 
355 360 365 
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Val Leu Asn Lys Ala Ser Arg Leu Pro Gly Thr Glu Gin Ser Val Ser 
370 375 380 

Ala Trp Asp Thr Gly Pro His Val Arg Thr Val His Ala 
385 390 395 
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CLAIMS 

What is claimed: 

1. Isolated or recombinant nucleic acid which encodes a 
mammalian uncoupling protein 3. 

5 2. The nucleic acid of Claim 1 wherein the uncoupling 
protein 3 is human. 

3 . The nucleic acid of Claim 1 selected from the group 
consisting of: SEQ ID NO: 1, SEQ ID NO: 2 arjd SEQ ID 
NO: 7 . 

10 4. The nucleic acid of Claim 1 wherein said nucleic acid 
hybridizes under stringent conditions with DNA 
selected from the group consisting of: SEQ ID NO: 1, 
the complement of SEQ ID NO : 1 , SEQ ID NO: 2 the 
complement of SEQ ID NO: 2, SEQ ID NO: 7 and the 

15 complement of SEQ ID NO: 7. 

5. The nucleic acid of Claim 1 wherein the nucleic acid 

encodes an amino acid sequence selected from the group 
consisting of: SEQ ID NO: 3, SEQ ID NO: 4, and SEQ ID 
NO . 8 . 

20 6. A recombinant nucleic acid construct comprising the 
nucleic acid of Claim 1. 

7 . The recombinant nucleic acid construct of Claim 6 

wherein the nucleic acid is selected from the group 
consisting of: SEQ ID NO: 1, SEQ ID NO: 2 and SEQ ID 
25 NO: 7. 



WO 98/45438 



-76- 



PCT/US98/06959 



8 . The recombinant nucleic acid construct of Claim 6 
wherein the nucleic acid encodes the amino acid 
sequence selected from the group consisting of: 'SEQ ID 
NO: 3, SEQ ID NO 4 , and SEQ ID NO: 8. 

5 9. The recombinant nucleic acid construct of Claim 6 
wherein the nucleic acid is operably linked 'to an 
expression control sequence. 

10. A host cell comprising the nucleic acid of Claim 1. 

11. The host cell of Claim 10 wherein the nucleic acid is 
10 operably linked to an expression control sequence, 

whereby mammalian uncoupling protein 3 is expressed 
when the host cell is maintained under conditions 
suitable for expression. 

12. A method for producing a mammalian uncoupling protein 
15, 3 comprising: 

a) introducing into a host cell a nucleic acid 
construct comprising a nucleic acid which encodes 
a mammalian uncoupling protein 3; and 

b) maintaining the host cells produced in step a) 
20 under conditions whereby the nucleic acid is 

expressed and the mammalian uncopling protein 3 
is produced. 



13 . An antibody or functional portion thereof which binds 
mammalian uncoupling protein 3. 

25 14. A method of detecting mammalian uncoupling protein 3 
in a sample comprising: 

a) . contacting a sample with an antibody which binds 
uncoupling protein 3, under conditions suitable 



WO 98/45438 



PCTAJS98/06959 



-77- 

for specific binding of said antibody to the 
mammalian uncoupling protein 3; and 

b) detecting an antibody-mammalian uncoupling 
protein 3 complex, 
5 wherein if the antibody-mammalian uncoupling protein 

complex is detected, mammalian uncoupling protein 3 is 

present in the sample. 

15. A method of identifying an agent which alters 
uncoupling protein 3 activity comprising the steps of: 

10 a) introducing into a host cell a nucleic acid 

construct comprising a nucleic acid which encodes 
a mammalian uncoupling protein 3; ■ 

b) maintaining the host cells produced in step a) 
under conditions appropriate for expression of 

15 the nucleic acid; 

c) contacting the cells of b) with the agent; and 

d) detecting mitochondrial electrical potential of 
the cells of c) in the presence of the agent, 

wherein a change in mitochondrial electrical potential 

2 0 in the presence of the agent indicates that the agent 

alters uncoupling protein 3 activity. 

16. The method of Claim 15 wherein the mitochondrial 
electrical potential is detected using fluorescence. 

17. A method of identifying an agent which is an activator 
25 of uncoupling protien 3 activity comprising the steps 

of: 

a) introducing into a host cell a nucleic acid 

construct comprising a nucleic acid which encodes 
a mammalian uncoupling protein 3; 

3 0 b) maintaining the host cells produced in step a) 

under conditions appropriate for expression of 
the nucleic acid; 
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c) contacting the cells of b) with the agent; and 

d) detecting mitochondrial electrical potential of 
the cells of c) in the presence of the agent; 

wherein a reduction in mitochondrial electrical 
5 potential in the presence of the agent indicates that 

the agent is an activator uncoupling protein 3 
activity. 

18. The method of Claim 17 wherein the mitochondrial 

electrical potential is detected using fluorescence. 

10 19. A method of identifying an agent which is an inhibitor 
of uncoupling protein 3 activity comprising Ithe steps 
of: 

a) introducing into a host cell a nucleic acid 
construct comprising a nucleic acid which encodes 

15 a mammalian uncoupling protein 3; 

b) maintaining the host cells produced in step a) 
under conditions appropriate for expression of 
the nucleic acid; 

c) contacting the cells of b) with the agent; and 
20 d) detecting mitochondrial electrical potential of 

the cells in the presence of the agent; 
wherein an increase in mitochondrial electrical 
potential in the presence of the agent indicates that 
the agent is an inhibitor uncoupling protein 3 
25 activity. 

20. The method of Claim 19 wherein the mitochondrial 

electrical potential is detected using fluorescence. 



30 



21. A method of inhibiting protein catabolism in a mammal 
comprising administering to the mammal an effective 
amount of an inhibitor of uncoupling protein 3 . 
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22 . A method of enhancing protein catabolism in a mammal 
comprising adminstering to the mammal an effective 
amount of an enhancer of uncoupling protein 3 . 

23. A method of inhibiting muscle wasting in a mammal 
5 comprising adminstering to the mammal an effective 

amount of an inhibitor of uncoupling protein 3 . 

24 . Use of an inhibitor of uncoupling protein 3 in a 
method of inhibiting protein catabolism in a mammal, 
wherein the method comprises administering to the 

10 mammal an effective amount of an inhibitor of 

uncoupling protein 3 . 

25. Use of an enhancer of uncoupling protein 3 in a method 
of enhancing protein catabolism in a mammal, wherein 
the method comprises administering to the mammal an 

15 effective amount of an enhancer of uncoupling protein 

3, 

26. Use of an inhibitor of uncoupling protein 3 in a 
method of inhibiting protein catabolism in a mammal, 
wherein the method comprises administering to the 

2 0 mammal an effective amount of an inhibitor of 

uncoupling protein 3 . 
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Vanfll I Bin I BfiQ 1 

t : J 

ACCACCGCCCATCCAATCCCTCCTCCCACCTCCTCCCATCCACCCCTAROGAGCCCCTCTGCTGCCCCTGCCCTSnCAnftAf.TrACACCCCCACCCCTCC 

) ■ 1 ' ! ■ 1 I 1 I 1 1 1 1 100 

TCCTCCCCCCrACSTTACCCACCACCCTCSACCACCCTACCTCCCCATCCCTCCCCCACACCACCUUUACUtiCACCSTCCTCACTCTCCQCOTCQCOACO 

RRCHPIPAATSWOCALCSPCAAPAVARI 7 A P P L 
CCA1QSLLPPPCHSP. CAPVLPLPWOOSOPHRC 
ECP3NPCCHLtCWSf'*CPLXCPCRaRTM3PTAA 

I 1 1 ' ' ' ' 



BstXl Eco57l BsbRI 

! I ! 

ACTGAAGCCCAGGGCTaTGCAGCAGCCTCTCTCCTTGGACCTCCTnTCftftCCCTAAACCRACTCCCCACACCCTTtCACCACT^Td^TTCCACTCAACCC 
TGACT7CCC6TCCCCACACCTCCTCGCAGACAC5AACCTGCAGCACA«CCCUCATT7CCC7CACCCCTCTCCCAAGCTCCTCATACCAACCTQACTTCC0 

H . SPGLW'SSLSPVISSRP.. ROWAEPSKTHVOLKP 
TEAOOCGAASLLGPPUGPKGTCQSLPCLWLD.S 
L K P RAV E0PLS L0 LL3 AI KCLCR AFO OTCWTEA 

5C057I 

: i 

7TCACACCTGCCTCCCACCATGGCTCTGAAGTTCCTGGGCGCAGGCACAGCAGCCTRTTTTGCTGAACTCGTTACr.TTTCCACTGGACACAf;CCAACCTC 

♦ '- t-w^—H ""^ i I ■ I i ! i i | i i 1 i 1 ' | 300 

AACTCTCCACCGAGGCTGGTACCGACACTTCAAGGACCCCCCTCCGTCTCCTCCGACAAAACCAC) I GAGCAA I SGAAAUCTCACCTCTSTCCCTTCt AC 

3DVPP7MAVKFLGAG.1AACFAELVTFPLD7AKV 
LOTCLPPWL. SSVGOA QQPVLLNSLPFHWT0PR3 
FARASHH'CCEVPCCRHSSLFC. TRYLSTGHSQG 



Xcml EnmllOSI 
fOtl Bsgl j Ncol EriHKI 

CCCCTGCACATCCACCCCGACAACCACCCCCTCCACACGGCCCGGCTCGTGCAGTACCGTGGCGTCCTGGGCACCATCCTCACCATGGTCCGGACTGAGG 

1 1 1 * HOO 

CCCCACGTCTACGTCCCCCTC7TGCTCCCCCACCTCTCCCCCCCCCAGCACGTCA1GGCACCCCACCACCCC7CCTACCACTCCTACCACCCC7CACTCC 

KLQlQGENQAVOTARLVOYROVt.OTrLTf.VRTC 
ACRSRCR7RRSRRPC3C37VACWAP3. PVCGLR 
PPADHCGtPGGPOGPARAVPVRACMHPDHOAO. G 

flgoAlV BspMI 
I Naet i Ttn l Bspi407t 

p«tl j j pel p«i jjrmini B»t<5l 

GTCCC7GCA'gCCCCTACAA75GGC7GG7GGCCGGCC7GCAGCGCCAGATGAGCT7CGCC7CCATCCSCA7CGGCC7C7A7GAC7CCC7CAAGCACC7CTA 

■ I I ' i ■ I 1 1 i I 600 

CAGGGACB7CGGGGA7G7TACCCGACCACCCCCCCGACGTCGuGGTC7ACTC6AAGCGGAG67AGGCG7AGCCGGAGA7AC7GAGGCAG77CCTCCACA7 

r 

SPCSrYNGLVAGLQRQriSFASIftlCLYOaVKQVY 
VPAAP7MCWWPACSAR. ASPPSASASf17PS8«C 
SLQPLOWACGRP^APOE I R L H P H R P L . L R Q A G 

Bpml NgoAiv Xcml 

psul ( . Bsgl \ Naei ) ; Ncol BstEU JBstXl 

CACCCCC , AAAGGCGCCGACAACTCCACCCTCAC7ACCCCCATT77SGCC5SCTGCACCACAGGAGCCA7GGCGG7GACC7GTCCCCAGCCCACACA7C7C 

_H i i 1 I t . 1 1 ■ I t i I I 1 600 

G7GGGGGT7TCCGCGCC'rCTTCACGTCGGACTCATGCCCC7AAAACCGGCCGACGTGGTGTCCTCCGTACCGCCACTGGACACCGGTCGGG7GTC7ACAC 

TPK GAONSSL 7 7R I L.AGC T 7 G 4 M A V T CAOPTOV 
TPPKARTTPA5UPGFWPAAPQEPWR. PVPSPOHV 
HPGRRGOLOPHYPOFCRLHHRSHGGOLCPAHRC 
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peal' peg I 

GTQAACCTCCGATTTCACCCCACCATACACCTCCCGCCAf CCAGGAUCGACAGAAAAt ACAGCGGGACTATGGACGCCTACAGAACCATCGCCACGGAGG 

I I ■ I ' I ' ■ 1 j i ■ > 700 

CACTTCCAGGCTAAAGTCCGGTCGrATGTGGAGCCCGGTAGGTCCTCGCTCTCTTTTATSTCGCCCTGATACCTCCGCATGTCrTGGTAGCGGTCCCTCC 

VKVRFOA5(HLGPSR30RICYSGTf1DAYRT[ARE 
. R8D?RPAYT6GHPGAT£HTA0LWTPTE°SPCR 
G E 0 P I SGOHTPRAIQEROK IQROYCRLONHROUG 



p« I JBsm I pot£ It 



BspH I Bum i 

aaccactcaggcccctctccaaaggaactttgcccaacatcatgaggaatg^ 
ttcctcagtccccgcacacctttccttgaaacgggttgtagtactccttacoatagcacttgacacgactccaccactggatqctqtagqagttcctctt 

EcVRCLWKGTLPh IHRNAiVNCAEYVTYDtLXEK 
KESCACGKELCPTS. GHISSTVLRW. P T T 8 8 R ft 
RSQGPY ERNFA. OHHEECYROLC. CCOLftHPQGE 



eoo 



NgOAiV 



f- r M ! Not I Paixr 

GCTCCTGGACTACCACCTGCTCACTCACAACTTCCCCTCCCACTTTCTCTCTCCCTTTCaAOCCCQCTTCTCTOCCACACTaOTOQCCTCCCCCCTCCAC. 

— 1 1 900 

CaACGACCTaATCCTCCACQAGTCACTQrTGAAGGGGACGaTCAAACAQAQACCGAAACCTCGCCCCAACACACCCTCTCACCACCCCACCCCCCACCTG 

LLOtHLLT0NFPCHFYSAFQAQFCATVVA3PY0 
3CVTTTCS:. TTSPATL SLPLEPASVPOWWPPRWT 
A A 0 L P P A H , 0 L P L ^ t. C L C L W 3 R L L C H 5 G G L P G G 

Bbsl 
Basil 

Bpm 1 Bsp120 1 

Gsui EcoSTI .Seal ! Apal 

i i \ r 

GTCCTCAACACCCCCSTATA7GAACTCACC7CCACGCCAGTACTTCACCCCCCTCCACIC f Al UA I AAAGATCGTCGCCCAGGACGGCCCCACACCCTTCT 

■ * 1 ■ > ■ ^ ■ I ■ » 1 ■ 'l i i i i i . ■ I 1000 

CACCACTTCTCCCCCATATACTTGAGTGGAGGTCCGGTCATGAAGTCGGGGGAGCTGACATACTATTTCrACCACCGG&TCCTCCCGGGQTCTCGGAAGA 

VVKTRYHN3PPG0YF3PL0CniKrtVAOE6PTAF 
W. RPCt. THLQAST3APSTV. . RWVPRRAP0P3 

R c . £ ? p .. v Y . c L T .. 8 .. R . p T L 0 p . p R L . T 0 * P G c p . c c 9 s L \ 

Bop120l 

pamHI ^vaiU j Apal 

ACAAGG0ATTTACACCCTCCTT7TTGCGTTTGCCATCCTGGAACCTCCTCATCTTCCTAACCTATCAGCACCI5AAACCCCCCCTCATGAAAG7CCAGAT 

■ i ' ■ ' ■ I * ■ ' ■ i ' ' * l 1 ■ I '!■ | . I UOO 

TCT7CCCTAAATCTCCGACGAAAAACGCAAACCCTAGGACCTTGCACCACTACAAGCAT7GGA7ACTCGTCGACTTTGCCCGGGACTACTTTCACG7CTA 

Y K G F TPSF LRLG5VN Y VMFVTYEQLK R ALHKVOH 
TROLHPPFCVWOPGTV. CS. PHSS. NOP. . KSR 
QG ! YTLLFAFG t LERCOVRNL. AAETOPQESPO 

GTTACGCCAATCACCCTTTgCA^CAACACAAGAACCCCACTGGTAGCTAACCTGTCCCAAACCAG7TAAGAATGCAAGAAAACCGTCCATCCACCCACAC 



CAATCCCCTTACTCCCAAAACT7GT7CTCTTCTTCCC3TCACCATCGATTGCACAGCCT7TCCTCAAT7CTTACCT7CT77TGCCAC07ACG7CCGTCTG 

.URESPF. TR QECHW, L.TCPK<»VKNGRKRC1HAH 
CYG NHRFE0DKKATG5. RVRN0LRMEENGA57HT 
VTC 1 TVLNK7RRPLVAKV3ETS. EVKK7VHPRT 
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S 46CGCCCXTC CAATCCCTICTBCCACCTCCrC00ATCCACCCCTACCCACCCCCTCTCeTgC |0Q 
CrCCCCSCTAaCTTACCCACGACGQTCWCCACCCUCCTCCCQATCCCTCCGCSACACOACGOCCACGCCACCCTCC^ 

« « li o i daaTSWDCALGSPCAAPAVACLTAPPL 
!• R o a "r P 0 6 L L P P P G M E P . C A P V L P U P W Q 0 S 0 P H R C 
E 6 P 3 N P C C M L L C V % P R E P I C C P C R Q R T M P T A A 

QstXt E00671 jB«aRI |Eoo5£U^ 
C TCAAGCCCACQCCTGTQGAQCAGCCTCTCTCCnCCA^ ^ 
aACTTCGaQTCCCCACACCTCCTCCCACACAOGA^ 

i sPQ LWGSLGPHfSSRP. R0WAEP5RT In V 6 I K P. 
. * r * « e r e A A fl L L 0 P P L G P K 0 T C 0 S I P C LULU. S 
\ Z K » R A C V E 0 P L 6 L 0, L L 3 A L * C L C R A F 0 P T G V T E A, 

ECOS7I 

TTCAQAC GTCCC7CCCACCATCCCTQTCAACTTCCTGGG0CCACCCACACCACCCTCTTTTGCTCAACTK 30Q 
iwOTCTCCACCaASGCTCGTACCCA^ 

0 r, » p p t MAVK FLGAOTAAC P A E 1 V T F P L D T A K V " 

> u i L L P H W I . — sSUC&AaqPVLLH SLfr FHW 1 u . P * 
*F "ft R A S H H C C C V P C C R H S S L f C . T RTLST GH800 



EamllOSI 

IK I 



Xcml EanV 

CGCCTGCAGATCCACCGCCAGAACCACCCCQTCCAGACCC^^ ftQQ 
CKCWOTCTABaTCCCCCTCTTM^ 

~~S i 5 i 5 0 E N n k V 0 T A R L V 0 T R G VUG T I I T H V W T E. - 

R a l k 5 r ii k i K k s r w p c c a I V A C U A H l . p» w ti u L K 
P P A 0 % 0 G E P G 0 P D C P A R. A V P W R A C H H P, D H C A D . C 



prtl J J Feel ptl JjrtMltl p«GI 

QTCCCTC CAOCCCCTACAATCCGCTCGrCCTOOCCTCCAGCGCCACATGAOCTTO^ BOO 
CACCCA=OTCCGGCATaTTACCCCACCACCGGCCCCACaTCGCCGTCTACTCCAACCCCA0OTACCCGTAGCCCCAGATACTCACCCACTTCCTCCACAT 

"7 p F sT Y N C L v ftQi nnnHBFAS 1 R 1 Q L Y H S V~* O v T 

B L O P L- 0 W A G C R P A A P D E L H L H P H R P L ■ L H O A C Y 

SB?. 1 b 30 » fpff pp~i p«» f«» 

CAC CCCCAAAGGCGCCOACAACTCCAGCCTCACTACCCG^ 60Q 
GTGGOGGTTTCCCCGCCTGT7CAGQTCCQACTGATGCGCCTAAAACCCCCCCACCTCC7CTCCTCCCTACCGCCACTCCACACGCCTCCCGTGTCTACAC 



T P K » * n * * * T T R 1 L A C C T T C A H A V T C * 0 iZjL 

-r P P K-' A S T T P A S L P G F W P A A P Q t P « « . V „ r „ » ^ u Q e 

H P Q R ' R C OLOPMYPOFGRLHK RSH CGDLCP-A WR 
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ninAi pool* pc3i 

STCAACCTCCflATTTCAGCCCAGCATACACCTCCCQ^ 7Q0 
CACTTCCACCCTAAAQTCCCCTCOrATGTCCACCtCCCTACCTCCTCCCTOTCTTTTATGTC&CCCTGATACCrCCOOA 



„ , u ur n a ** l H i G P 8 R S O * « « t h n ft Y ft T 1 A R F - 

! 11 J J I ft P A Y T 5 O H P fi A I k fi I A U L W T P T E P 3 P 0 R 

0 EC P I S C 0 H T P R A 1 0 E R 0 K 10 B 0 Y C R LOW H R 0 Q G 

AAQOAQTCACSCOCCTGTGGAAACCAACTTTOCCCAACATCATCASCAATGCTATCttTCAACTCTC 60Q 
TTCCTCACTCCCCGGACACCTTTCCTTCAAACCGGTTGTAGTACTCCTTAro 

- \ \ \ \ \ \ \ \ \ i l 1 1 v . '» H . i «. v '» Y * w- Vv XvX'XW " 

R 6 0 S P V E R W f A 0 H H E E C T R a L C ■ C C D L R H P 0 C E 

Nooaiv 

GCTCCICCA CTACCACCTCCTCACTGACJUCTTCeCCTB«ACTTW fiQ0 
CGAC0ACC"ICATCCTGGACGAQTGACTGTTGAACG5CACC0TGAAACAQA3AC0GAAACCTC0GCCGAACACACCCTGTCACCACCC0AOGCCCCAtCTG 

' "" ' HY H L L T 0 N F ^_^^ _ C u ^ * P A R F C A T- V V A 3 P V 0 

5 c w I I I ii i I t r S P A \ ^ S L P L fc .P A 6 V P a W W ^ P « ' 

A A G L P P A H . G L P I P L C L C L V B R L I C H 3 G C I P 0 C . 

Bbii 

Igli TO, 

CTCCTGAACACCCPQTATATCAACTCACCTCCAGCCCA^TACTTCASCCCCCTCGACTC ^ 



CACCACTTCTGCCCCATATACTTGAOTOGAGGTCCCtrrCATGAAOTCCCCGQAGCTOACATACTATTTCTACCACCCGCTCCTCCCCCCCTGrCOGAACA 

y y k T R Y w N S P P C Q Y F S P L 0 C H 1 K H V A 0 E C P T A F __ 
U . ft P CI* T H L fl A 1 L A F 5 \ V . . K M W p H ft a *■ a r ^~ 
G E 0 PVYEL7SRPVL0P PRLVOIC D C C P C C P H S L L 

BflftfU 

SOTCAgC 



ACAACCSOTCAfccCTCCTCCTCCCTCCAQCACTCCC 

i-r i» 1038 

TCTTCCCCACTCCGAQGAGGACGGAGCTCCTQAGGO 

V K C- , \ A 5 S C L 0 H 8 
TRGE-PPPASSTP 
C5VSLLLPPALP 
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